WO 2005/096804 



PCT/US2004/007182 



SEQUENCE LISTING 



<110> Lanahan, Mike 

<120> Self -processing Plants and Plant Parts 

<130> 109846.317 

<140> US 60/315,281 
< 1 4 1 > 2001-08-27 

<160> 112 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 436 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic 



<400> 1 







by o 


i yr 


T i i 

JjC U 


r.l n 


T.pn 

i_J *J 


Glu 


Glu 

W U- N-4 


Gl v 


Gl v 


Val 


He 


Met 


Gin 


Ala 


1 




5 










10 










15 




Phe 


Tyr 


Trp 


Asp 


Val 


Pro 


Ser 


Gly 


Gly 


He 


Trp 


Trp 


Asp 


Thr 


lie 


Arg 




20 










25 










30 






Gin 


Lys 


lie 


Pro 


Glu 


Trp 


Tyr 


Asp 


Ala 


Gly 


He 


Ser 


Ala 


He 


Trp 


lie 




35 










40 










45 








Pro 


Pro 
50 


Ala 


Ser 


Lys 


Gly 


Met 
55 


Ser 


Gly 


Gly 


Tyr 


Ser 
60 


Met 


Gly 


Tyr 


Asp 


Pro 


Tyr 


Asp 


Tyr 


Phe 


Asp 


Leu 


Gly 


Glu 


Tyr 


Tyr 


Gin 


Lys 


Gly 


Thr 


Val 


65 








70 










75 










80 


Glu 


Thr 


Arg 


Phe 


Gly 


Ser 


Lys 


Gin 


Glu 


Leu 


He 


Asn 


Met 


lie 


Asn 


Thr 








85 










90 










95 




Ala 


His 


Ala 


Tyr 
100 


Gly 


He 


Lys 


Val 


He 
105 


Ala 


Asp 


lie 


Val 


lie 
110 


Asn 


His 


Arg 


Ala 


Gly 


Gly 


Asp 


Leu 


Glu 


Trp 


Asn 


Pro 


Phe 


Val 


Gly 


Asp 


Tyr 


Thr 




115 










120 










125 








Trp 


Thr 


Asp 


Phe 


Ser 


Lys 


Val 


Ala 


Ser 


Gly 


Lys 


Tyr 


Thr 


Ala 


Asn 


Tyr 


130 








135 










140 










Leu 


Asp 


Phe 


His 


Pro 


Asn 


Glu 


Leu 


His 


Ala 


Gly 


Asp 


Ser 


Gly 


Thr 


Phe 


145 








150 










155 










160 


Gly 


Gly 


Tyr 


Pro 


Asp 


He 


Cys 


His 


Asp 


Lys 


Ser 


Trp 


Asp 


Gin 


Tyr 


Trp 






165 










170 










175 




Leu 


Trp 


Ala 


Ser 


Gin 


Glu 


Ser 


Tyr 


Ala 


Ala 


Tyr 


Leu 


Arg 


Ser 


lie 


Gly 






180 










185 










190 






lie 


Asp 


Ala 


Trp 


Arg 


Phe 


Asp 


Tyr 


Val 


Lys 


Gly 


Tyr 


Gly 


Ala 


Trp 


Val 




195 










200 










205 








Val 


Lys 
210 


Asp 


Trp 


Leu 


Asn 


Trp 
215 


Trp 


Gly 


Gly 


Trp 


Ala 
220 


Val 


Gly 


Glu 


Tyr 


Trp 


Asp 


Thr 


Asn 


Val 


Asp 


Ala 


Leu 


Leu 


Asn 


Trp 


Ala 


Tyr 


Ser 


Ser 


Gly 


225 








230 










235 










240 


Ala 


Lys 


Val 


Phe 


Asp 


Phe 


Pro 


Leu 


Tyr 


Tyr 


Lys 


Met 


Asp 


Ala 


Ala 


Phe 








245 










250 










255 




Asp 


Asn 


Lys 


Asn 


lie 


Pro 


Ala 


Leu 


Val 


Glu 


Ala 


Leu 


Lys 


Asn 


Gly 


Gly 
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260 










265 










270 






Thr 


Val 


Val 

275 


Ser 


Arg 


Asp 


Pro 


Phe 
280 


Lys 


Ala 


Val 


Thr 


Phe 
285 


Val 


Ala 


Asn 


His 


Asp 


Thr 


Asp 


He 


He 


Trp 


Asn 


Lys 


Tyr 


Pro 


Ala 


Tyr 


Ala 


Phe 


He 




290 










295 










300 








Leu 


Thr 


Tyr 


Glu 


Gly 


Gin 


Pro 


Thr 


He 


Phe 


Tyr 


Arg 


Asp 


Tyr 


Glu 


Glu 


305 










310 










315 










320 


Trp 


Leu 


Asn 


Lys 


Asp 
325 


Lys 


Leu 


Lys 


Asn 


Leu 
330 


lie 


Trp 


He 


His 


Asp 
335 


Asn 


Leu 


Ala 


Gly 


Gly 
340 


Ser 


Thr 


Ser 


He 


Val 
345 


Tyr 


Tyr 


Asp 


Ser 


Asp 
350 


Glu 


Met 


lie 


Phe 


Val 

355 


Arg 


Asn 


Gly 


Tyr 


Gly 
360 


Ser 


Lys 


Pro 


Gly 


Leu 
365 


He 


Thr 


Tyr 


He 


Asn 


Leu 


Gly 


Ser 


Ser 


Lys 


Val 


Gly 


Arg 


Trp 


Val 


Tyr 


Val 


Pro 


Lys 




370 










375 










380 








Phe 


Ala 


Gly 


Ala 


Cys 


He 


His 


Glu 


T yr 


Thr 


Gl v 


Asn 


Le u 


Gl v 


Gl v 


Trn 


385 










390 










395 










400 


Val 


Asp 


Lys 


Tyr 


Val 
405 


Tyr 


Ser 


Ser 


Gly 


Trp 
410 


Val 


Tyr 


Leu 


Glu 


Ala 
4 15 


Pro 


Ala 


Tyr 


Asp 


Pro 
420 


Ala 


Asn 


Gly 


Gin 


Tyr 
425 


Gly 


Tyr 


Ser 


Val 


Trp 
430 


Ser 


Tyr 


Cys 


Gly 


Val 
435 


Gly 



























<210> 2 
<211> 1308 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic 
<400> 2 

atggccaagt acctggagct ggaggagggc ggcgtgatca tgcaggcgtt ctactgggac 60 
gtcccgagcg gaggcatctg gtgggacacc atccgccaga agatccccga gtggtacgac 120 
gccggcatct ccgcgatctg gataccgcca gcttccaagg gcatgtccgg gggctactcg 180 
atgggctacg acccgtacga ctacttcgac ctcggcgagt actaccaqaa gggcacggtg 240 
gagacgcgct tcgggtccaa gcaggagctc atcaacatga tcaacacggc gcacgcctac 300 
ggcatcaagg tcatcgcgga catcgtgatc aaccacaggg ccggcggcga cctggagtgg 360 
aacccgttcg tcggcgacta cacctggacg gacttctcca aggtcgcctc cggcaagtac 420 
accgccaact acctcgactt ccaccccaac gagctgcacg cgggcgactc cggcacgttc 480 
ggcggctacc cggacatctg ccacgacaag tcctgggacc agtactggct ctgggcctcg 540 
caggagtcct acgcggccta cctgcgctcc atcggcatcg acgcgtggcg cttcgactac 600 
gtcaagggct acggggcctg ggtggtcaag gactggctca actggtgggg cggctgggcg 660 
gtgggcgagt actgggacac caacgtcgac gcgctgctca actgggccta ctcctccggc 720 
gccaaggtgt tcgacttccc cctgtactac aagatggacg cggccttcga caacaagaac 780 
atcccggcgc tcgtcgaggc cctgaagaac ggcggcacgg tggtctcccg cgacccgttc 840 
aaggccgtga ccttcgtcgc caaccacgac acggacatca tctggaacaa gtacccggcg 900 
tacgccttca tcctcaccta cgagggccag cccacgatct tctaccgcga ctacgaggag 960 
tggctgaaca aggacaagct caagaacctg atctggattc acgacaacct cgcgggcggc 1020 
tccactagta tcgtgtacta cgactccgac gagatgatct tcgtccgcaa cggctacggc 1080 
tccaagcccg gcctgatcac gtacatcaac ctgggctcct ccaaggtggg ccgctgggtg 1140 
tacgtcccga agttcgccgg cgcgtgcatc cacgagtaca ccggcaacct cggcggctgg 1200 
gtggacaagt acgtgtactc ctccggctgg gtctacctgg aggccccggc ctacgacccc 1260 
gccaacggcc agtacggcta ctccgtgtgg tcctactgcg gcgtcggc 1308 

<210> 3 
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<211> 800 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> synthetic 



<400> 3 

Met Gly His Trp 
1 

Asp Asp Phe Gly 

20 

Lys Val Gly lie 
35 

Ala Lys Asp Arg 
50 

lie Leu Gin Gly 
65 

Pro Arg lie Phe 

Phe Leu Thr Asn 

100 

Thr Val Asp Gly 
115 

Pro Thr Asp lie 
130 

Ser Leu Lys Glu 
145 

Gly Tyr Lys Pro 

Tyr Tyr Asp Gly 

180 

Phe Arg Val Trp 
195 

Lys Asn Gly Glu 
210 

Lys Gly Asn Gly 
225 

Val Phe Tyr Leu 

Val Asp Pro Tyr 

260 

Val Val Asn Leu 
275 

Gly Pro Lys lie 
290 

He Ala Asp He 
305 

Leu Tyr Leu Gly 

Thr Thr Gly Leu 

340 

He Leu Pro Phe 
355 

Phe Glu Lys Tyr 
370 

Pro Glu Gly Arg 

385 



Tyr Lys His Gin 
5 

Lys Val Ala Val 

He Val Arg Leu 

40 

Phe lie Glu He 
55 

Val Glu Glu He 
70 

Phe Ala Gin Ala 
85 

Pro Val Asp Thr 

Lys Glu lie Pro 

120 

Asp Val Thr Asn 
135 

Glu Asp Leu Arg 
150 

Ala Arg Val lie 
165 

Glu Leu Gly Ala 

Ser Pro Val Ser 

200 

Asp Thr Glu Pro 
215 

Val Trp Glu Ala 

230 

Tyr Gin Leu Glu 
245 

Ser Lys Ala Val 

Ala Arg Thr Asn 

280 

Glu Gly Tyr Glu 
295 

Thr Gly Leu Glu 
310 

Leu Thr Glu Glu 
325 

Ser His Leu Val 

Phe Asp Phe Tyr 

360 

Tyr Asn Trp Gly 
375 

Tyr Ser Thr Asp 
390 



Arg Ala Tyr Gin 
10 

Val Lys Leu Pro 
25 

Asn Glu Trp Gin 

Lys Asp Gly Lys 

60 

Phe Tyr Glu Lys 
75 

Arg Ser Asn Lys 

90 

Lys Lys Lys Glu 
105 

Val Ser Arg Val 

Tyr Val Arg lie 

140 

Lys Asp Val Glu 
155 

Met Met Glu lie 
170 

Val Tyr Ser Pro 
185 

Lys Trp Val Lys 

Tyr Gin Val Val 

220 

Val Val Glu Gly 
235 

Asn Tyr Gly Lys 
250 

Tyr Ala Asn Asn 
265 

Pro Glu Gly Trp 

Asp Ala lie lie 

300 

Asn Ser Gly Val 
315 

Asn Thr Lys Gly 
330 

Glu Leu Gly Val 
345 

Thr Gly Asp Glu 

Tyr Asp Pro Tyr 

380 

Pro Lys Asn Pro 
395 



Phe Thr Gly Glu 
15 

Met Asp Leu Thr 

30 

Ala Lys Asp Val 
45 

Ala Glu Val Trp 

Pro Asp Thr Ser 

80 

Val lie Glu Ala 

95 

Leu Phe Lys Val 
110 

Glu Lys Ala Asp 
125 

Val Leu Ser Glu 

Leu He He Glu 

160 

Leu Asp Asp Tyr 
175 

Glu Lys Thr lie 
190 

Val Leu Leu Phe 
205 

Asn Met Glu Tyr 

Asp Leu Asp Gly 

240 

lie Arg Thr Thr 
255 

Gin Glu Ser Ala 
270 

Glu Asn Asp Arg 
285 

Tyr Glu lie His 

Lys Asn Lys Gly 

320 

Pro Gly Gly Val 
335 

Thr His Val His 

350 

Leu Asp Lys Asp 
365 

Leu Phe Met Val 

His Thr Arg lie 

400 
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Arg Glu Val Lys 

Val lie Met Asp 

420 

Ser Ala Phe Asp 
435 

Thr Gly Ala Tyr 
450 

Glu Arg Pro Met 
4 65 

Val Lys Glu Tyr 

lie Asp Lys Lys 

500 

Asp Pro Thr lie 
515 

Pro lie Arg Phe 
530 

Phe Asn Asp Glu 
545 

Ser Val Lys Gly 

Lys Arg Gly Val 

580 

Ser Phe Ala Leu 
595 

Asp Asn His Thr 
610 

Lys Lys Lys Glu 
625 

Ala Gly Ala He 

Gly Gin Asp Phe 

660 

Ala Pro He Ser 
675 

He Asp Val Phe 
690 

His Pro Ala Phe 

705 

Glu Phe Leu Pro 

His Ala Gly Gly 

740 

Asn Leu Glu Lys 
755 

Val Val Asn Ser 
770 

Gly Thr He Glu 
785 



<210> 4 
<2H> 2400 
<212> DNA 
<213> Artificial 

<220> 



Glu Met Val Lys 
405 

Met Val Phe Pro 

Gin Thr Val Pro 

440 

Leu Asn Glu Ser 
455 

Met Arg Lys Phe 
470 

His He Asp Gly 
485 

Thr Met Leu Glu 

lie Leu Tyr Gly 

520 

Gly Lys Ser Asp 
535 

Phe Arg Asp Ala 
550 

Phe Val Met Gly 
565 

Val Gly Ser He 

Asp Pro Glu Glu 

600 

Leu Trp Asp Lys 
615 

Trp Thr Glu Glu 
630 

Leu Leu Thr Ser 
645 

Cys Arg Thr Thr 

He Asn Gly Phe 

680 

Asn Tyr His Lys 
695 

Arg Leu Lys Asn 
710 

Gly Gly Arg Arg 
725 

Asp Pro Trp Lys 

Thr Thr Tyr Lys 

760 

Gin Lys Ala Gly 
775 

Leu Asp Pro Leu 
790 



Sequence 



Ala Leu His Lys 
4 10 

His Thr Tyr Gly 
425 

Tyr Tyr Phe Tyr 

Gly Cys Gly Asn 

460 

He Val Asp Thr 
475 

Phe Arg Phe Asp 
490 

Val Glu Arg Ala 
505 

Glu Pro Trp Gly 

Val Ala Gly Thr 

540 

He Arg Gly Ser 
555 

Gly Tyr Gly Lys 
570 

Asn Tyr Asp Gly 
585 

Thr He Asn Tyr 

Asn Tyr Leu Ala 

620 

Glu Leu Lys Asn 
635 

Gin Gly Val Pro 
650 

Asn Phe Asn Asp 
665 

Asp. Tyr Glu Arg 

Gly Leu He Lys 

700 

Ala Glu Glu He 
715 

He Val Ala Phe 
730 

Asp He Val Val 
745 

Leu Pro Glu Gly 

Thr Glu Val He 

780 

Ser Ala Tyr Val 
795 



Page 4 



His Gly He Gly 
4 15 

He Gly Glu Leu 
430 

Arg He Asp Lys 
445 

Val He Ala Ser 

Val Thr Tyr Trp 

480 

Gin Met Gly Leu 
495 

Leu His Lys He 
510 

Gly Trp Gly Ala 
525 

His Val Ala Ala 

Val Phe Asn Pro 

560 

Glu Thr Lys He 
575 

Lys Leu lie Lys 
590 

Ala Ala Cys His 
605 

Ala Lys Ala Asp 

Ala Gin Lys Leu 

640 

Phe Leu His Gly 

655 

Asn Ser Tyr Asn 
670 

Lys Leu Gin Phe 
685 

Leu Arg Lys Glu 

Lys Lys His Leu 

720 

Met Leu Lys Asp 
735 

lie Tyr Asn Gly 

750 

Lys Trp Asn Val 
765 

Glu Thr Val Glu 

Leu Tyr Arg Glu 

800 
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--:223> synthetic 
<400> 4 

atgggccact ggtacaagca ccagcgcgcc taccagttca ccggcgagga cgacttcggg 60 
aaggtggccg tggtgaagct cccgatggac ctcaccaagg tgggcatcat cgtgcgcctc 120 
aacgagtggc aggcgaagga cgtggccaag gaccgcttca tcgagatcaa ggacggcaag 180 
gccgaggtgt ggatactcca gggcgtggag gagatcttct acgagaagcc ggacacctcc 240 
ccgcgcatct tcttcgccca ggcccgctcc aacaaggtga tcgaggcctt cctcaccaac 300 
ccggtggaca ccaagaagaa ggagctgttc aaggtgaccg tcgacggcaa ggagatcccg 360 
gtgtcccgcg tggagaaggc cgacccgacc gacatcgacg tgaccaacta cgtgcgcatc 420 
gtgctctccg agtccctcaa ggaggaggac ctccgcaagg acgtggagct gatcatcgag 480 
ggctacaagc cggcccgcgt gatcatgatg gagatcctcg acgactacta ctacgacggc 540 
gagctggggg cggtgtactc cccggagaag accatcttcc gcgtgtggtc cccggtgtcc 600 
aagtgggtga aggtgctcct cttcaagaac ggcgaggaca ccgagccgta ccaggtggtg 660 
aacatggagt acaagggcaa cggcgtgtgg gaggccgtgg tggagggcga cctcgacggc 720 
qtgttctacc tctaccagct ggagaactac ggcaagatcc gcaccaccgt ggacccgtac 780 
tccaaggccg tgtacgccaa caaccaggag tctgcagtgg tgaacctcgc ccgcaccaac 840 
ccggagggct gggagaacga ccgcggcccg aagatcgagg gctacgagga cgccatcatc 900 
tacgagatcc acatcgccga catcaccggc ctggagaact ccggcgtgaa gaacaagggc 960 
ctctacctcg gcctcaccga ggagaacacc aaggccccgg gcggcgtgac caccggcctc 1020 
tcccacctcg tggagctggg cgtgacccac gtgcacatcc tcccgttctt cgacttctac 1080 
accggcgacg agctggacaa ggacttcgag aagtactaca actggggcta cgacccgtac 1140 
ctcttcatgg tgccggaggg ccgctactcc accqacccqa aqaacccgca cacccgaatt 1200 
cgcgaggtga aggagatggt gaaggccctc cacaagcacg gcatcggcgt gatcatggac 1260 
atggtgttcc cgcacaccta cggcatcggc gagctgtccg ccttcgacca gaccgtgccg 1320 
tactacttct accgcatcga caagaccggc gcctacctca acgagtccgg ctgcggcaac 1380 
gtgatcgcct ccgagcgccc gatgatgcgc aagttcatcg tggacaccgt gacctactgg 1440 
gtgaaggagt accacatcga cggcttccgc ttcgaccaga tgggcctcat cgacaagaag 1500 
accatgctgg aggtggagcg cgccctccac aagatcgacc cgaccatcat cctctacggc 1560 
gagccgtggg gcggctgggg ggccccgatc cgcttcggca agtccgacgt ggccggcacc 1620 
cacgtggccg ccttcaacga cgagttccgc gacgccatcc gcggctccgt gttcaacccg 1680 
tccgtgaagg gcttcgtgat gggcggctac ggcaaggaga ccaagatcaa gcgcggcgtg 1740 
gtgggctcca tcaactacga cggcaagctc atcaagtcct tcgccctcga cccggaggag 1800 
accatcaact acgccgcctg ccacgacaac cacaccctct gggacaagaa ctacctcgcc 1860 
gccaaggccg acaagaagaa ggagtggacc gaggaggagc tgaagaacgc ccagaagctc 1920 
gccggcgcca tcctcctcac tagtcagggc gtgccgttcc tccacggcgg ccaggacttc 1980 
tgccgcacca ccaacttcaa cgacaactcc tacaacgccc cgatctccat caacggcttc 2040 
gactacgagc gcaagctcca gttcatcgac gtgttcaact accacaaggg cctcatcaag 2100 
ctccgcaagg agcacccggc cttccgcctc aagaacgccg aggagatcaa gaagcacctg 2160 
gagttcctcc cgggcgggcg ccgcatcgtg gccttcatgc tcaaggacca cgccggcggc 2220 
gacccgtgga aggacatcgt ggtgatctac aacggcaacc tggagaagac cacctacaag 2280 
ctcccggagg gcaagtggaa cgtggtggtg aactcccaga aggccggcac cgaggtgatc 2340 
gagaccgtgg agggcaccat cgagctggac ccgctctccg cctacgtgct ctaccgcgag 2400 



<210> 5 
< 2 1 1 > 693 
<212> PRT 

<213> Sulfolobus solfataricus 

<400> 5 

Met Glu Thr lie Lys lie Tyr Glu Asn Lys Gly Val Tyr Lys Val Val 

15 10 15 

He Gly Glu Pro Phe Pro Pro He Glu Phe Pro Leu Glu Gin Lys He 

20 25 30 

Ser Ser Asn Lys Ser Leu Ser Glu Leu Gly Leu Thr lie Val Gin Gin 

35 40 45 

Gly Asn Lys Val He Val Glu Lys Ser Leu Asp Leu Lys Glu His He 
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50 

lie Gly Leu Gly 
65 

Tyr Val Met Tyr 

Pro Leu Tyr Val 

100 

Ala Thr Gly Tyr 
115 

Gly Leu Glu Glu 
130 

Val Glu Phe Tyr 
14 5 

Lys Tyr Thr Glu 

Phe Gly Tyr Met 

180 

Val Glu Leu Val 
195 

Val Phe Leu Asp 
210 

His Pro Tyr Arg 
225 

Lys Arg Asn Val 

Asp Gin Asn Tyr 

260 

lie Glu Ser Gly 
275 

Val Tyr Pro Asp 
290 

Leu lie Ser Glu 
305 

Met Asn Glu Pro 

Leu Ser Ser Leu 

340 

Phe Pro Asp Asn 
355 

Glu Lys Val Arg 

370 

Lys Gly Phe Arg 
385 

Ala Gly Tyr Ala 

Asn Thr Pro Ser 

420 

Gly Leu Ser He 
4 35 

Phe Gin Gly Arg 
450 

Val Lys Tyr Tyr 
465 

Lys Ala Thr Asp 

Tyr Lys Glu Lys 

500 

Pro Tyr lie Tyr 



55 

Glu Lys Ala Phe 
70 

Asn Val Asp Ala 
85 

Ser He Pro Leu 

Phe Phe Asn Ser 

120 

Tyr Asp Lys Val 
135 

Val He Glu Gly 
150 

Leu Thr Gly Lys 
165 

He Ser Arg Tyr 

Asp He Met Gin 

200 

He His Tyr Met 
215 

Phe Pro Glu Pro 
230 

Lys Leu He Thr 
245 

Ser Pro Phe Leu 

Glu Leu Phe Val 

280 

Phe Phe Arg Glu 

295 

Trp Leu Ser Gin 
310 

Thr Asp Phe Ser 

325 

Pro Val Gin Phe 

Val Val His Tyr 

360 

Asn Ala Tyr Pro 

375 

Thr Ser His Arg 
390 

Gly He Gin Arg 
405 

Trp Asp Asp Leu 

Ser Gly Val Pro 

440 

Asn Phe Ala Glu 
455 

Ala Leu Ala Leu 
470 

Gly lie Asp Thr 
485 

Val Lys Glu He 
Ser Leu Ala Leu 



60 

Glu Leu Asp Arg 

7 5 

Gly Ala Tyr Lys 
90 

Phe He Ser Val 
105 

Ala Ser Lys Val 

He Val Thr He 

140 

Pro Arg He Glu 
155 

Pro Phe Leu Pro 
170 

Ser Tyr Tyr Pro 
185 

Lys Glu Gly Phe 

Asp Ser Tyr Lys 

220 

Lys Lys Leu lie 

235 

lie Val Asp His 
250 

Ser Gly Met Gly 
265 

Gly Lys Met Trp 

Asp Thr Arg Glu 

300 

Gly Val Asp Gly 
315 

Arg Ala lie Glu 

, 330 
Arg Asp Asp Arg 
345 

Leu Arg Gly Lys 

Leu Tyr Glu Ala 

380 

Asn Glu lie Phe 
395 

Tyr Ala Phe lie 
410 

Lys Leu Gin Leu 
4 25 

Phe Val Gly Cys 

He Asp Asn Ser 

460 

Phe Phe Pro Phe 
475 

Glu Pro Val Phe 
490 

Val Glu Leu Arg 
505 

Glu Ala Ser Glu 



Lys Arg Lys Arg 

80 

Lys Tyr Gin Asp 
95 

Lys Asp Gly Val 
110 

lie Phe Asp Val 
125 

Pro Glu Asp Ser 

Asp Val Leu Glu 

160 

Pro Met Trp Ala 
175 

Gin Asp Lys Val 
190 

Arg Val Ala Gly 
205 

Leu Phe Thr Trp 

Asp Glu Leu His 

240 

Gly lie Arg Val 
255 

Lys Phe Cys Glu 
270 

Pro Gly Thr Thr 
285 

Trp Trp Ala Gly 

lie Trp Leu Asp 

320 

lie Arg Asp Val 
335 

Leu Val Thr Thr 
350 

Arg Val Lys His 
365 

Met Ala Thr Phe 

lie Leu Ser Arg 

400 

Trp Thr Gly Asp 
4 15 

Gin Leu Val Leu 
430 

Asp lie Gly Gly 
445 

Met Asp Leu Leu 

Tyr Arg Ser His 

480 

Leu Pro Asp Tyr 
495 

Tyr Lys Phe Leu 
510 

Lys Gly His Pro 
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515 










520 


Val 


He 


Arg 


Pro 


Leu 


Phe 


Tyr 


Glu 




530 








535 




Arg 


He 


Glu 


Asp 


Glu 


Tyr 


Met 


Val 


545 










550 






He 


Val 


Ser 


Lys 


Glu 
565 


Glu 


Ser 


Arg 


Tro 


Tyr 


Asn 


Tyr 
580 


Trp 


Asn 


Gly 


Glu 


Lys 


Ser 


Thr 


His 


Glu 


Leu 


Pro 


He 




595 










600 


Pro 


Leu 
610 


Glu 


Gly 


Asp 


Glu 


Leu 

615 


He 


Arg 


Tyr 


Asp 


Asn 


Ala 


Glu 


He 


Thr 


625 










630 






Ser 


Arg 


Glu 


He 


Tyr 


Val 


Ser 


Lys 








645 








Val 


Ser 


Lys 


He 

660 


He 


Val 


Asp 


Asp 


Thr 


Met 


Gin 
675 


Asn 


Thr 


Tyr 


Val 


Ala 
680 


Lys 


He 

690 


Asn 


Leu 


Glu 









525 



Phe 


Gin 


Asp 


Asp 
540 


Asp 


Asp 


Met 


Tyr 


Gly 


Lys 


Tyr 
555 


Leu 


Leu 


Tyr 


Ala 


Pro 

560 


Leu 


Val 
570 


Thr 


Leu 


Pro 


Arg 


Gly 
575 


Lys 


He 


He 


Asn 


Gly 


Lys 


Ser 


Val 


Val 


585 










590 






Tyr 


Leu 


Arg 


Glu 


Gly 

605 


Ser 


He 


He 


Val 


Tyr 


Gly 


Glu 
620 


Thr 


Ser 


Phe 


Lys 


Ser 


Ser 


Ser 
635 


Asn 


Glu 


He 


Lys 


Phe 
640 


Leu 


Thr 
650 


He 


Thr 


Ser 


Glu 


Lys 
655 


Pro 


Ser 


Lys 


Glu 


He 


Gin 


Val 


Glu 


Lys 


665 










670 






Lys 


He 


Asn 


Gin 


Lys 


He 


Arg 


Gly 



685 



<210> 6 
<211> 2082 
<212> DMA 

<213> Sulfolobus solfataricus 
<400> 6 

atggagacca tcaagatcta cgagaacaag ggcgtgtaca aggtggtgat cggcgagccg 60 
ttcccgccga tcgagttccc gctcgagcag aagatctcct ccaacaagtc cctctccgag 120 
ctgggcctca ccatcgtgca gcagggcaac aaggtgatcg tggagaagtc cctcgacctc 180 
aaggagcaca tcatcggcct cggcgagaag gccttcgagc tggaccgcaa gcgcaagcgc 240 
tacgtgatgt acaacgtgga cgccggcgcc tacaagaagt accaggaccc gctctacgtg 300 
tccatcccgc tcttcatctc cgtgaaggac ggcgtggcca ccggctactt cttcaactcc 360 
gcctccaagg tgatcttcga cgtgggcctc gaggagtacg acaaggtgat cgtgaccatc 420 
ccggaggact ccgtggagtt ctacgtgatc gagggcccgc gcatcgagga cgtgctcgag 480 
aagtacaccg agctgaccgg caagccgttc ctcccgccga tgtgggcctt cggctacatg 540 
atctcccgct actcctacta cccgcaggac aaggtggtgg agctggtgga catcatgcag 600 
aaggagggct tccgcgtggc cggcgtgttc ctcgacatcc actacatgga ctcctacaag 660 
ctcttcacct ggcacccgta ccgcttcccg gagccgaaga agctcatcga cgagctgcac 720 
aagcgcaacg tgaagctcat caccatcgtg gaccacggca tccgcgtgga ccagaactac 780 
tccccgttcc tctccggcat gggcaagttc tgcgagatcg agtccggcga gctgttcgtg 840 
ggcaagatgt ggccgggcac caccgtgtac ccggacttct tccgcgagga cacccgcgag 900 
tggtgggccg gcctcatctc cgagtggctc tcccagggcg tggacggcat ctggctcgac 960 
atgaacgagc cgaccgactt ctcccgcgcc atcgagatcc gcgacgtgct ctcctccctc 1020 
ccggtgcagt tccgcgacga ccgcctcgtg accaccttcc cggacaacgt ggtgcactac 1080 
ctccgcggca agcgcgtgaa gcacgagaag gtgcgcaacg cctacccgct ctacgaggcg 1140 
atggccacct tcaagggctt ccgcacctcc caccgcaacg agatcttcat cctctcccgc 1200 
gccggctacg ccggcatcca gcgctacgcc ttcatctgga ccggcgacaa caccccgtcc 1260 
tgggacgacc tcaagctcca gctccagctc gtgctcggcc tctccatctc cggcgtgccg 1320 
ttcgtgggct gcgacatcgg cggcttccag ggccgcaact tcgccgagat cgacaactcg 1380 
atggacctcc tcgtgaagta ctacgccctc gccctcttct tcccgttcta ccgctcccac 1440 
aaggccaccg acggcatcga caccgagccg gtgttcctcc cggactacta caaggagaag 1500 
gtgaaggaga tcgtggagct gcgctacaag ttcctcccgt acatctactc cctcgccctc 1560 
gaggcctccg agaagggcca cccggtgatc cgcccgctct tctacgagtt ccaggacgac 1620 
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gacgacatgt accgcatcga ggacgagtac atggtgggca agtacctcct ctacgccccg 1680 

atcgtgtcca aggaggagtc ccgcctcgtg accctcccgc gcggcaagtg gtacaactac 1740 

tggaacggcg agatcatcaa cggcaagtcc gtggtgaagt ccacccacga gctgccgatc 1800 

tacctccgcg agggctccat catcccqctc gagggcgacg agctgatcgt gtacggcgag 1860 

acctccttca agcgctacga caacgccgag atcacctcct cctccaacga gatcaagttc 1920 

tcccgcgaga tctacgtgtc caagctcacc atcacctccg agaagccggt gtccaagatc 1980 

atcgtggacg actccaagga gatccaggtg gagaagacca tgcagaacac ctacgtggcc 2040 

aagatcaacc agaagatccg cggcaagatc aacctcgagt ga 2082 

<210> 7 
<211> 1818 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> synthetic 

<400> 7 

atggcggctc tggccacgtc gcagctcgtc gcaacgcgcg ccggcctggg cgtcccggac 60 

gcgtccacgt tccgccgcgg cgccgcgcag ggcctgaggg gggcccgggc gtcggcggcg 120 

gcggacacgc tcagcatgcg gaccagcgcg cgcgcggcgc ccaggcacca gcaccagcag 180 

gcgcgccgcg gggccaggtt cccgtcgctc gtcgtgtgcg ccagcgccgg catgaacgtc 240 

gtcttcgtcg gcgccgagat ggcgccgtgg agcaagaccg gaggcctcgg cgacgtcctc 300 

ggcggcctgc cgccggccat ggccgcgaac gggcaccgtg tcatggtcgt ctctccccgc 360 

tacgaccagt acaaggacgc ctgggacacc agcgtcgtgt ccgagatcaa gatgggagac 420 

gggtacgaga cggtcaggtt cttccactgc tacaagcgcg gagtggaccg cgtgttcgtt 480 

gaccacccac tgttcctgga gagggtttgg ggaaagaccg aggagaagat ctacgggcct 540 

gtcgctggaa cggactacag ggacaaccag ctgcggttca gcctgctatg ccaggcagca 600 

cttgaagctc caaggatcct gagcctcaac aacaacccat acttctccgg accatacggg 660 

gaggacgtcg tgttcgtctg caacgactgg cacaccggcc ctctctcgtg ctacctcaag 720 

agcaactacc agtcccacgg catctacagg gacgcaaaga ccgctttctg catccacaac 780 

atctcctacc agggccggtt cgccttctcc gactacccgg agctgaacct ccccgagaga 840 

ttcaagtcgt ccttcgattt catcgacggc tacgagaagc ccgtggaagg ccggaagatc 900 

aactggatga aggccgggat cctcgaggcc gacagggtcc tcaccgtcag cccctactac 960 

gccgaggagc tcatctccgg catcgccagg ggctgcgagc tcgacaacat catgcgcctc 1020 

accggcatca ccggcatcgt caacggcatg gacgtcagcg agtgggaccc cagcagggac 1080 

aagtacatcg ccgtgaagta cgacgtgtcg acggccgtgg aggccaaggc gctgaacaag 1140 

gaggcgctgc aggcggaggt cgggctcccg gtggaccgga acatcccgct ggtggcgttc 1200 

atcggcaggc tggaagagca gaagggcccc gacgtcatgg cggccgccat cccgcagctc 1260 

atggagatgg tggaggacgt gcagatcgtt ctgctgggca cgggcaagaa gaagttcgag 1320 

cgcatgctca tgagcgccga ggagaagttc ccaggcaagg tgcgcgccgt ggtcaagttc 1380 

aacgcggcgc tggcgcacca catcatggcc ggcgccgacg tgctcgccgt caccagccgc 1440 

ttcgagccct gcggcctcat ccagctgcag gggatgcgat acggaacgcc ctgcgcctgc 1500 

gcgtccaccg gtggactcgt cgacaccatc atcgaaggca agaccgggtt ccacatgggc 1560 

cgcctcagcg tcgactgcaa cgtcgtggag ccggcggacg tcaagaaggt ggccaccacc 1620 

ttgcagcgcg ccatcaaggt ggtcggcacg ccggcgtacg aggagatggt gaggaactgc 1680 

atgatccagg atctctcctg gaagggccct gccaagaact gggagaacgt gctgctcagc 1740 

ctcggggtcg ccggcggcga gccaggggtt gaaggcgagg agatcgcgcc gctcgccaag 1800 

gagaacgtgg ccgcgccc 1818 

<210> 8 
<211> 606 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic 
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<400> 8 

Met Ala Ala Leu 
1 

Gly Val Pro Asp 

20 

Arg Gly Ala Arg 
35 

Ser Ala Arg Ala 

50 

Ala Arg Phe Pro 
65 

Val Phe Val Gly 

Gly Asp Val Leu 

100 

Arg Val Met Val 
115 

Asp Thr Ser Val 
130 

Val Arg Phe Phe 
145 

Asp His Pro Leu 

lie Tyr Gly Pro 

180 

Phe Ser Leu Leu 
195 

Leu Asn Asn Asn 
210 

Phe Val Cys Asn 
225 

Ser Asn Tyr Gin 

Cys lie His Asn 

260 

Pro Glu Leu Asn 
275 

Asp Gly Tyr Glu 

290 

Ala Gly lie Leu 
305 

Ala Glu Glu Leu 

lie Met Arg Leu 

340 

Ser Glu Trp Asp 
355 

Val Ser Thr Ala 

370 

Ala Glu Val Gly 
385 

lie Gly Arg Leu 

lie Pro Gin Leu 

420 

Gly Thr Gly Lys 
435 

Lys Phe Pro Gly 



Ala Thr Ser Gin 

5 

Ala Ser Thr Phe 

Ala Ser Ala Ala 

40 

Ala Pro Arg His 
55 

Ser Leu Val Val 

70 

Ala Glu Met Ala 

85 

Gly Gly Leu Pro 

Val Ser Pro Arg 

120 

Val Ser Glu lie 
135 

His Cys Tyr Lys 
150 

Phe Leu Glu Arg 
165 

Val Ala Gly Thr 

Cys Gin Ala Ala 

200 

Pro Tyr Phe Ser 
215 

Asp Trp His Thr 

230 

Ser His Gly lie 

245 

lie Ser Tyr Gin 

Leu Pro Glu Arg 

280 

Lys Pro Val Glu 
295 

Glu Ala Asp Arg 
310 

He Ser Gly He 
325 

Thr Gly He Thr 

Pro Ser Arg Asp 

360 

Val Glu Ala Lys 
375 

Leu Pro Val Asp 
390 

Glu Glu Gin Lys 
405 

Met Glu Met Val 

Lys Lys Phe Glu 

440 

Lys Val Arg Ala 



Leu Val Ala Thr 

10 

Arg Arg Gly Ala 

25 

Ala Asp Thr Leu 

Gin His Gin Gin 

60 

Cys Ala Ser Ala 

75 

Pro Trp Ser Lys 
90 

Pro Ala Met Ala 
105 

Tyr Asp Gin Tyr 

Lys Met Gly Asp 

140 

Arg Gly Val Asp 
155 

Val Trp Gly Lys 
170 

Asp Tyr Arg Asp 
185 

Leu Glu Ala Pro 

Gly Pro Tyr Gly 

220 

Gly Pro Leu Ser 
235 

Tyr Arg Asp Ala 

250 

Gly Arg Phe Ala 

265 

Phe Lys Ser Ser 

Gly Arg Lys He 

300 

Val Leu Thr Val 
315 

Ala Arg Gly Cys 
330 

Gly He Val Asn 
345 

Lys Tyr He Ala 

Ala Leu Asn Lys 

380 

Arg Asn He Pro 
395 

Gly Pro Asp Val 
410 

Glu Asp Val Gin 
425 

Arg Met Leu Met 
Val Val Lys Phe 



Arg Ala Gly Leu 
15 

Ala Gin Gly Leu 
30 

Ser Met Arg Thr 
45 

Ala Arg Arg Gly 

Gly Met Asn Val 

80 

Thr Gly Gly Leu 
95 

Ala Asn Gly His 
110 

Lys Asp Ala Trp 
125 

Gly Tyr Glu Thr 

Arg Val Phe Val 

160 

Thr Glu Glu Lys 
175 

Asn Gin Leu Arg 
190 

Arg lie Leu Ser 
205 

Glu Asp Val Val 

Cys Tyr Leu Lys 

240 

Lys Thr Ala Phe 
255 

Phe Ser Asp Tyr 

270 

Phe Asp Phe He 
285 

Asn Trp Met Lys 

Ser Pro Tyr Tyr 

320 

Glu Leu Asp Asn 
335 

Gly Met Asp Val 
350 

Val Lys Tyr Asp 
365 

Glu Ala Leu Gin 

Leu Val Ala Phe 

400 

Met Ala Ala Ala 
415 

He Val Leu Leu 
430 

Ser Ala Glu Glu 
445 

Asn Ala Ala Leu 
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4 50 










455 










460 










Ala 


His 


His 


lie 


Me t 


A "1 -i 

Ala 


biy 


Aia 


AS p 


va l 


Le u 


Aia 


v a i 


T 1 H v 

inr 




Ay-/-* 

nl y 


465 










470 










475 










480 


Pne 


Glu 


Pro 


Cys 


Gly 


Leu 


lie 


bin 


Leu 


LjI n 


Gly 


Met 


Ar g 


1 yr 


bi y 


inr 








485 










/ion 










H D 




Pro 


Cys 


Ala 


Cys 


A "1 

Ala 


jr*. V 

Ser 


inr 


biy 


Lj i y 


Leu 


v a i 


AS p 


i nr 


T 1 A 

lie 


T 1 o 
11c 


o ± u 






500 










DUD 










J 1U 






Gly 


Lys 


Thr 
515 


Gly 


Pne 


His 


Me t 


Gly 
520 


A n 

Ar g 


Leu 


O ^-v V- 

oe r 


v a i 


AS p 

525 


Lys 


A c n 

MS n 


Veil 


Va 1 


Glu 
530 


Pro 


Ala 


TV 

Asp 


vai 


Lys 
535 


ij y s 


v a i 


Mia 


i n r 


T K v 

i nr 
540 




oin 


Mr y 




lie 


Lys 


Va 1 


Va 1 


Gly 


Thr 


Pro 


Ala 


Tyr 


GlU 


GlU 


Met 


val 


A 

Ar g 


as n 


Lys 


545 






550 










555 










560 


Met 


He 


Gin 


Asp 


Leu 


Ser 


Trp 


Lys 


Gly 


Pro 


Ala 


Lys 


Asn 


Trp 


Glu 


Asn 








565 










570 










575 




Val 


Leu 


Leu 


Ser 
580 


Leu 


Gly 


Val 


Ala 


Gly 
585 


Gly 


Glu 


Pro 


Gly 


Val 
590 


Glu 


Gly 


Glu 


Glu 


He 
595 


Ala 


Pro 


Leu 


Ala 


Lys 
600 


Glu 


Asn 


Val 


Ala 


Ala 

605 


Pro 







<210> 9 
<211> 2223 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic 
<400> 9 

atggccaagt acctggagct ggaggagggc ggcgtgatca tgcaggcgtt ctactgggac 60 
gtcccgagcg gaggcatctg gtgggacacc atccgccaga agatccccga gtggtacgac 120 
gccggcatct ccgcgatctg gataccgcca gcttccaagg geatgtcegg gggctactcg 180 
atgggctacg acccgtacga ctacttcgac cteggegagt actaccagaa gggcacggtg 240 
gagacgeget tegggtccaa gcaggagctc atcaacatga tcaacacggc gcacgcctac 300 
ggcatcaagg teatcgegga categtgate aaccacaggg ccggcggcga cctggagtgg 360 
aacccgttcg teggegacta cacctggacg gacttctcca aggtcgcctc eggcaagtae 420 
accgccaact acctcgactt ccaccccaac gagctgeacg cgggcgactc cggcacgttc 480 
ggcggctacc eggacatctg ccacgacaag tcctgggacc agtactggct ctgggcctcg 540 
caggagtcct acgcggccta cctgcgctcc ateggcateg acgcgtggcg cttcgactac 600 
gtcaagggct aeggggectg ggtggtcaag gactggctca actggtgggg cggctgggcg 660 
gtgggcgagt actgggacac caacgtcgac gcgctgctca actgggecta ctcctccggc 720 
gccaaggtgt tcgacttccc cctgtactac aagatggacg cggccttcga caacaaqaac 780 
atcccggcgc tegtcgagge cctgaagaac ggeggcaegg tggtctcccg cgacccgttc 840 
aaggccgtga ccttcgtcgc caaccacgac aeggacatea tctggaacaa gtacccggcg 900 
tacgccttca tcctcaccta egagggecag cccacgatct tctaccgcga ctacgaggag 960 
tggctgaaca aggacaagct caagaacctg atctggattc acgacaacct cgcgggcggc 1020 
tccactagta tcgtgtacta cgactccgac gagatgatct tcgtccgcaa cggctacggc 1080 
tccaagcccg gcctgatcac gtacatcaac ctgggctcct ccaaggtggg ccgctgggtg 1140 
tacgtcccga agttcgccgg cgcgtgcatc cacgagtaca ccggcaacct eggeggctgg 1200 
gtggacaagt acgtgtactc ctccggctgg gtctacctgg aggccccggc ctacgacccc 1260 
gccaacggcc agtaeggcta ctccgtgtgg tcctactgcg gcgtcggcac ategattget 1320 
ggcatcctcg aggecgacag ggtcctcacc gtcagcccct actacgccga ggagctcatc 1380 
tccggcatcg ccaggggctg cgagctcgac aacatcatgc gcctcaccgg catcaccggc 1440 
ategtcaacg gcatggacgt cagcgagtgg gaccccagca gggacaagta catcgccgtg 1500 
aagtacgacg tgtcgaegge cgtggaggcc aaggcgctga acaaggaggc getgeaggeg 1560 
gaggteggge tcccggtgga ceggaaoate ccgctggtgg cgttcatcgg caggctggaa 1620 
gagcagaagg gccccgacgt catggcggcc gccatcccgc agctcatgga gatggtggag 1680 
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gacgtgcaga tcgttctgct gggcacgggc aagaagaagt tcgagcgcat gctcatgagc 1740 

gccgaggaga agttcccagg caaggtgcgc gccgtggtca agttcaacgc ggcgctggcg 1800 

caccacatca tggccggcgc cgacgtgctc gccgtcacca gccgcttcga gccctgcggc 1860 

ctcatccagc tgcaggggat gcgatacgga acgccctgcg cctgcgcgtc caccggtgga 1920 

ctcgtcgaca ccatcatcga aggcaagacc gggttccaca tgggccgcct cagcgtcgac 1980 

tgcaacgtcg tggagccggc ggacgtcaag aaggtggcca ccaccttgca gcgcgccatc 2040 

aaggtggtcg gcacgccggc gtacgaggag atggtgagga actgcatgat ccaggatctc 2100 

tcctggaagg gccctgccaa gaactgggag aacgtgctgc tcagcctcgg ggtcgccggc 2160 

ggcgagccag gggttgaagg cgaggagatc gcgccgctcg ccaaggagaa cgtggccgcg 2220 

ccc 2223 

<210> 10 
<211> 741 



12> PRT 



<213> Artificial Sequence 



<220> 

<223> synthetic 



<400> 10 




Met 

n 


TV 1 _ 

Al a 


T . . « 

Lys 


Tyr 


1 

Phe 


Tyr 


Trp 


Asp 








20 


Gin 


Lys 


lie 


Pro 










Pro 


Pro 


Ala 


Se r 




D U 






Pro 


Tyr 


AS p 


Tyr 


D D 








Glu 


Thr 


TV 

Arg 


Pn e 


Ala 


His 


Ala 


Tyr 








100 


Arg 


Ala 


Gly 


Gly 






115 




Trp 


Thr 


Asp 


Phe 




130 






Leu 


Asp 


Phe 


His 


14 5 








Gly 


Gly 


Tyr 


Pro 


Leu 


Trp 


Ala 


Ser 








180 


lie 


Asp 


Ala 


Trp 






195 




Val 


Lys 


Asp 


Trp 




210 






Trp 


Asp 


Thr 


Asn 


225 








Ala 


Lys 


Val 


Phe 


Asp 


Asn 


Lys 


Asn 








260 


Thr 


Val 


Val 


Ser 






275 




His 


Asp 


Thr 


Asp 




290 







Leu 


Glu 


Leu 


Glu 


b 








Va 1 


Pro 


C* V" 

b e r 


Gly 


Glu 


I rp 


Tyr 


ASp 








40 


Lys 


Gly 


Met 


Ser 






.J w> 




Pne 


ASp 


Leu 






i n 






Gly 


Ser 


Lys 


Gin 


85 








Gly 


He 


Lys 


Val 


Asp 


Leu 


Glu 


Trp 








120 


Ser 


Lys 


Val 


Ala 






135 




Pro 


Asn 


Glu 


Leu 




150 






Asp 


He 


Cys 


His 


165 








Gin 


Glu 


Ser 


Tyr 


Arg 


Phe 


Asp 


Tyr 








200 


Leu 


Asn 


Trp 


Trp 






215 




Val 


Asp 


Ala 


Leu 




230 






Asp 


Phe 


Pro 


Leu 


245 








He 


Pro 


Ala 


Leu 


Arg 


Asp 


Pro 


Phe 








280 


He 


lie 


Trp 


Asn 






295 





Glu Gly Gly Val 
10 

Gly He Trp Trp 
25 

Ala Gly Tie Ser 

Gly Gly Tyr Ser 

60 

Glu Tyr Tyr Gin 
75 

Glu Leu lie Asn 
90 

He Ala Asp lie 
105 

Asn . Pro Phe Val 

Ser Gly Lys Tyr 

140 

His Ala Gly Asp 
155 

Asp Lys Ser Trp 
170 

Ala Ala Tyr Leu 

185 

Val Lys Gly Tyr 

Gly Gly Trp Ala 

220 

Leu Asn Trp Ala 
235 

Tyr Tyr Lys Met 
250 

Val Glu Ala Leu 
265 

Lys Ala Val Thr 

Lys Tyr Pro Ala 

300 



He Met Gin Ala 
15 

Asp Thr lie Arg 
30 

Ala lie Trp lie 
4 5 

Met Gly Tyr Asp 

Lys Gly Thr Val 

80 

Met lie Asn Thr 

95 

Val He Asn His 
110 

Gly Asp Tyr Thr 
125 

Thr Ala Asn Tyr 

Ser Gly Thr Phe 

160 

Asp Gin Tyr Trp 
175 

Arg Ser He Gly 
190 

Gly Ala Trp Val 
205 

Val Gly Glu Tyr 

Tyr Ser Ser Gly 

240 

Asp Ala Ala Phe 
255 

Lys Asn Gly Gly 
270 

Phe Val Ala Asn 
285 

Tyr Ala Phe lie 
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Leu 


Thr 


Tyr 


Glu 


Gl y 


Gin 


Pro 


T 1 v- 

l n r 


305 










310 






Trp 


Leu 


Asn 


Lys 


Asp 


Lys 


Le u 


Lys 










32 5 








Leu 


Ala 


Gly 


Gly 


Ser 


Thr 


be r 


lie 








b 4 U 










I le 


Phe 


Val 


Arg 


Asn 


Gly 


Tyr 


biy 






*D C C 

3 b 5 










*3 ^n 


lie 


Asn 


Leu 


Gly 


Ser 


Ser 


Lys 


val 




37 0 










J / J 




Phe 


Ala 


Gly 


Ala 


Cys 


lie 


HIS 


Gl U 


385 










1 A A 

3 90 






Val 


Asp 


Lys 


Tyr 


Va 1 


Tyr 


f™* 

Ser 


be r 










/inc. 
4 U b 








Ala 


Tyr 


Asp 


Pro 


Ala 


As n 


Gly 


bin 








4 20 










Cys 


Gly 


Va 1 


Gly 


Thr 


Ser 


lie 


Ala 




4 b b 










a a n 


Leu 


Thr 


Val 


Ser 


Pro 


Tyr 


I yr 


Ala 




4 50 










/ICC. 

4 J J 




Arg 


Gly 


Cys 


Glu 


Leu 


Asp 


As n 


lie 


4 65 










4 7 0 






lie 


Val 


Asn 


Gly 


Met 


Asp 


val 


O V 

her 










/IOC 

4 8b 








Tyr 


lie 


TV T 

Ala 


Va 1 


Lys 


Tyr 


ASp 


v a l 








500 










Leu 


Asn 


Lys 


Glu 


T\ 1 ~ 

Ala 


Leu 


Gin 


Ala 






515 










con 


Asn 


I le 


Pro 


Leu 


Va l 


Ala 




lie 




530 










c, ^ c, 
D j j 




Pro 


Asp 


Va 1 


Met 


Ala 


a i a 


Aid 




r /l r 

5 4 5 








^ n 






Asp 


Val 


Gin 


lie 


val 


Leu 


LcU 


\j x y 








f, c, 
ODD 








Met 


Leu 


Met 


ber 


Ala 


Kj 1 U 


pi,, 
blli 


Ly5 








C O A 

boU 










Val 


Lys 


Phe 


Asn 


Ala 


Ala 


Leu 


Aia 




C Pi c 

b y b 










DUU 


Va 1 


Leu 


Ala 


Va 1 


I n r 


O y-^ -w- 

ber 


Arg 


D r-i a 

rne 




610 










DID 




Gin 


Gly 


Me t 


Arg 


Tyr 


<o i y 


i n r 


D y r\ 


62 5 










& -} n 






T . 

Leu 


Va 1 


ASp 


T" W v- 

i n r 


lie 


T 1 A 

lie 




{j± y 


















Leu 


Ser 


Va i 


ASp 


Lys 


7\ c i~v 

as n 


Vdl 


v a -L 








DoU 










Ala 


rr-» ) 

Thr 


Thr 


Leu 


Gin 


Arg 


Aia 


T 1 o 

lie 






67 5 










a q r\ 


Glu 


Glu 


Met 


Val 


Arg 


Asn 


Cys 


Met 




690 










695 




Pro 


Ala 


Lys 


Asn 


Trp 


Glu 


Asn 


Val 


705 










710 






Gly 


Glu 


Pro 


Gly 


Val 


Glu 


Gly 


Glu 








725 








Asn 


Val 


Ala 


Ala 


Pro 









740 



lie 


Pne 


Tyr 


Arg 


ASp 


Tyr- 


b 1 U 


pin 

b 1 U 






OLD 










^ w 


Asn 


Leu 


lie 


Trp 


T I Q 

lie 


MIS 


MS p 


A c n 
Moll 




j JU 










^ ? s 




Va l 


Tyr 


Tyr 


Asp 


C r-\ v 

o e r 


AS p 


Pill 
b 1 U 














J Ju 






O -w* 

be r 


Lys 


r"i y^** 

r r o 


biy 


i_»e u 


lie 


1(11 


i yr 










j u ^ 








Gly 


A *V J*** 

Arg 


Trp 


v a i 


1 yr 


\/a 1 


Prn 
r i u 


by o 








Ton 
J o u 










Tyr 


1 n r 


Gly 


as n 


Leu 


pi 

biy 


P 1 \/ 

biy 


i rp 




one 

jyj 










a no 


Gly 


Trp 


Va l 


Tyr 


Le u 


PI 
bl U 


Aia 


D v~ r\ 

tr r o 




4 1 U 










H i. D 




Tyr 


Gly 


Tyr 


be r 


va i 


Trp 


Cor 

o e r 


i yr 


/I O Q 










^ ^n 

H J v 






Gly 


T 1 n 

lie 


Leu 


b3 1 U 


Aia 






vdl 










A A R 








CjI u 


olU 


Leu 


11c 


C! o r" 


oi y 


T 1 o 

lie 


A 1 ^ 
nl a 








^ D *J 










Me t 


Arg 


Leu 


i n r 


biy 


T 1 Q 

lie 


1 111 


P 1 \/ 

biy 






4 7 s 

HID 










4 8 0 


Glu 


Trp 


ASp 


r ro 


C /=* »- 
OcL 




Mop 


Lyb 




/inn 

4 90 










4 y 0 




Se r 


i n r 


Ala 


vai 


pi,, 

bl'J 


A 1 o 

Aia 


Lys 


A 1 ^ 
Mia 


jU J 










sin 

J 1 u 






GlU 


va 1 


CjI y 


j_.e u 


O v 

tr r o 


\/ n 1 

v a i 


Ann 

MS p 


A y- /-t 

Mi g 










s ? s 

J i J 








biy 


A V /^I 

Ar g 


Leu 




m n 

ul u 


Pin 


by o 


P 1 V 
"o i y 








s 4 n 

J H \J 










r r o 


La x n 






Pin 




val 


P 1 u 






s s s 

J J J 










560 

\* v/ 


T 1 V—, v- 

i n r 


b i y 


T \ / c 

i-» y s 


by O 


T \ / o 

by o 


Php 

Lilt; 


Pin 


A r n 














57 5 




rne 


D v- /-\ 

r r o 


o ± y 


Lyb 


vdl 


L VJ 


A 1 ^ 
r\±. a 


vai 


C.QC 
JO J 










5 90 






HIS 


HIS 


T 1 0 

lie 




A I -i 
n 1 ca 


p 1 \/ 
uiy 


A 1 ^ 
Mia 


A c: o 










fins 

O V D 








bill 


D v" v-\ 


(~* \ t c 


o i y 


T oil 


T 1 p 
11c 


P 1 n 

Ol 1 1 


Toil 








U /C u 










cys 


Ala 

Ai. a 


by b 




C o K 
OCl 


liii 


P 1 V 


P 1 V 
o i y 




6 "3 S 
u j ~> 










64 0 


LyS 


i n r 


biy 




nio 


Mat- 


P 1 V 
uiy 


A r n 




^ ^ n 










655 

VJ ^ *J 




bill 


D v 

r r o 


Mia 




vdl 


T \ ; c: 


Ly j 


V^i 1 
vai 


a £. c, 
DO J 










o / u 






Lys 


1 I « 1 

Va 1 


Va l 


Gly 


T K v- 

l nr 


fro 


A 1 3 

Aia 


1 yr 








685 








lie 


Gin 


Asp 


Leu 


Ser 


Trp 


Lys 


Gly 








700 










Leu 


Leu 


Ser 


Leu 


Gly 


Val 


Ala 


Gly 






715 










720 


Glu 


He 


Ala 


Pro 


Leu 


Ala 


Lys 


Glu 




730 










735 
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60 



180 
240 
300 



<210> 11 
<211> 1515 
<212> DNA 
<213> Zea mays 

<400> 11 

agagagctat gagacgtatg tcctcaaagc cactttgcat tgtgtgaaac caatatcgat 

ctttgttact tcatcatgca tgaacatttg tggaaactac tagcttacaa gcattagtga 120 

cagctcagaa aaaagttatc tatgaaaggt ttcatgtgta ccgtgggaaa tgagaaatgt 

tgccaactca aacaccttca atatgttgtt tgcaggcaaa ctcttctgga agaaaggtgt 

ctaaaactat gaacgggtta cagaaaggta taaaccacgg ctgtgcattt tggaagtatc 

atctatagat gtctgttgag gggaaagccg tacgccaacg ttatttactc agaaacagct 360 

tcaacacaca gttgtctgct ttatgatggc atctccaccc aggcacccac catcacctat 420 

ctctcgtgcc tgtttatttt cttgcccttt ctgatcataa aaaaacatta agagtttgca 480 

aacatgcata ggcatatcaa tatgctcatt tattaatttg ctagcagatc atcttcctac 540 

tctttacttt atttattgtt tgaaaaatat gtcctgcacc tagggagctc gtatacagta 600 

ccaatgcatc ttcattaaat gtgaatttca gaaaggaagt aggaacctat gagagtattt 660 

ttcaaaatta attagcggct tctattatgt ttatagcaaa ggccaagggc aaaattggaa 720 

cactaatgat ggttggttgc atgagtctgt cgattacttg caagaaatgt gaacctttgt 780 

tcctgtgcgt gggcataaaa caaacagctt ctagcctctt ttacggtact tgcacttgca 840 

agaaatgtga actccttttc atttctgtat gtggacataa tgccaaagca tccaggcttt 900 

ttcatggttg ttgatgtctt tacacagttc atctccacca gtatgccctc ctcatactct 960 

atataaacac atcaacagca tcgcaattag ccacaagatc acttcgggag gcaagtgcga 1020 

tttcgatctc gcagccacct ttttttgttc tgttgtaagt ataccttccc ttaccatctt 1080 

tatctgttag tttaatttgt aattgggaag tattagtgga aagaggatga gatgctatca 1140 

tctatgtact ctgcaaatgc atctgacgtt atatgggctg cttcatataa tttgaattgc 1200 

tccattcttg ccgacaatat attgcaaggt atatgcctag ttccatcaaa agttctgttt 1260 

tttcattcta aaagcatttt agtggcacac aatttttgtc catgagggaa aggaaatctg 1320 

ttttggttac tttgcttgag .gtgcattctt catatgtcca gttttatgga agtaataaac 1380 

ttcagtttgg tcataagatg tcatattaaa gggcaaacat atattcaatg ttcaattcat 1440 

cgtaaatgtt ccctttttgt aaaagattgc atactcattt atttgagttg caggtgtatc 1500 
cagtagttgg aggag 1515 

<210> 12 
<211> 673 
<212> DNA 
<213> Zea mays 



<400> 12 

gatcatccag gtgcaaccgt ataagtccta aagtggtgag gaacacgaaa caaccatgca 60 

ttggcatgta aagctccaag aatttgttgt atccttaaca actcacagaa catcaaccaa 120 

aattgcacgt caagggtatt gggtaagaaa caatcaaaca aatcctctct gtgtgcaaag 180 

aaacacggtg agtcatgccg agatcatact catctgatat acatgcttac agctcacaag 240 

acattacaaa caactcatat tgcattacaa agatcgt ttc atgaaaaata aaataggccg 300 

gacaggacaa aaatccttga cgtgtaaagt aaatttacaa caaaaaaaaa gccatatgtc 360 

aagctaaatc taat tcgttt tacgtagatc aacaacctgt agaaggcaac aaaactgagc 420 

cacgcagaag tacagaatga ttccagatga accatcgacg tgctacgtaa agagagtgac 480 

gagtcatata catttggcaa gaaaccatga agctgcctac agccgtctcg gtggcataag 540 

aacacaagaa attgtgttaa ttaatcaaag ctataaataa cgctcgcatg cctgtgcact 600 

tctccatcac caccactggg tcttcagacc attagcttta tctactccag agcgcagaag 660 

aacccgatcg aca 673 

<210> 13 
<21 1 > 4 54 
<212> PRT 

<213> Artificial Sequence 



<220> 
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<223> synthetic 

<400> 13 
Met Arg Val Leu 
1 

Ala Thr Ser Ala 

20 

Gin Ala Phe Tyr 
35 

He Arg Gin Lys 
50 

Trp He Pro Pro 
65 

Tyr Asp Pro Tyr 

Thr Val Glu Thr 

100 

Asn Thr Ala His 
115 

Asn His Arg Ala 
130 

Tyr Thr Trp Thr 
145 

Asn Tyr Leu Asp 

Thr Phe Gly Gly 

180 

Tyr Trp Leu Trp 
195 

He Gly He Asp 
210 

Trp Val Val Lys 

225 

Glu Tyr Trp Asp 

Ser Gly Ala Lys 

260 

Ala Phe Asp Asn 
275 

Gly Gly Thr Val 
290 

Ala Asn His Asp 
305 

Phe He Leu Thr 

Glu Glu Trp Leu 

340 

Asp Asn Leu Ala 
355 

Glu Met He Phe 
370 

Thr Tyr He Asn 
385 

Pro Lys Phe Ala 

Gly Trp Val Asp 

420 

Ala Pro Ala Tyr 



Leu Val Ala Leu 
5 

Lys Tyr Leu Glu 

Trp Asp Val Pro 

40 

lie Pro Glu Trp 
55 

Ala Ser Lys Gly 

70 

Asp Tyr Phe Asp 
85 

Arg Phe Gly Ser 

Ala Tyr Gly He 

120 

Gly Gly Asp Leu 
135 

Asp Phe Ser Lys 
150 

Phe His Pro Asn 
165 

Tyr Pro Asp He 

Ala Ser Gin Glu 

200 

Ala Trp Arg Phe 
215 

Asp Trp Leu Asn 

230 

Thr Asn Val Asp 
245 

Val Phe Asp Phe 

Lys Asn He Pro 

280 

Val Ser Arg Asp 
295 

Thr Asp He lie 
310 

Tyr Glu Gly Gin 
325 

Asn Lys Asp Lys 

Gly Gly Ser Thr 

360 

Val Arg Asn Gly 
375 

Leu Gly Ser Ser 
390 

Gly Ala Cys He 
405 

Lys Tyr Val Tyr 
Asp Pro Ala Asn 



Ala Leu Leu Ala 
10 

Leu Glu Glu Gly 
25 

Ser Gly Gly lie 

Tyr Asp Ala Gly 

60 

Met Ser Gly Gly 
75 

Leu Gly Glu Tyr 
90 

Lys Gin Glu Leu 
105 

Lys Val lie Ala 

Glu Trp Asn Pro 

140 

Val Ala Ser Gly 
155 

Glu Leu His Ala 
170 

Cys His Asp Lys 
185 

Ser Tyr Ala Ala 

Asp Tyr Val Lys 

220 

Trp Trp Gly Gly 
235 

Ala Leu Leu Asn 
250 

Pro Leu Tyr Tyr 

265 

Ala Leu Val Glu 

Pro Phe Lys Ala 

300 

Trp Asn Lys Tyr 
315 

Pro Thr lie Phe 
330 

Leu Lys Asn Leu 
345 

Ser lie Val Tyr 

Tyr Gly Ser Lys 

380 

Lys Val Gly Arg 
395 

His Glu Tyr Thr 
410 

Ser Ser Gly Trp 
425 

Gly Gin Tyr Gly 



Leu Ala Ala Ser 
15 

Gly Val lie Met 

30 

Trp Trp Asp Thr 
45 

lie Ser Ala lie 

Tyr Ser Met Gly 

80 

Tyr Gin Lys Gly 
95 

lie Asn Met He 
110 

Asp lie Val lie 
125 

Phe Val Gly Asp 

Lys Tyr Thr Ala 

160 

Gly Asp Ser Gly 
175 

Ser Trp Asp Gin 
190 

Tyr Leu Arg Ser 

205 

Gly Tyr Gly Ala 

Trp Ala Val Gly 

240 

Trp Ala Tyr Ser 
255 

Lys Met Asp Ala 
270 

Ala Leu Lys Asn 
285 

Val Thr Phe Val 

Pro Ala Tyr Ala 

320 

Tyr Arg Asp Tyr 
335 

lie Trp lie His 
350 

Tyr Asp Ser Asp 
365 

Pro Gly Leu lie 

Trp Val Tyr Val 

400 

Gly Asn Leu Gly 
415 

Val Tyr Leu Glu 
430 

Tyr Ser Val Trp 
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435 440 445 

Ser Tyr Cys Gly Val Gly 
450 



<210> 14 
<211> 460 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic 
<400> 14 



Met 


Ar g 


Val 


T 

Leu 


Leu 


Va 1 


TV "1 «» 

Ala 


Leu 


Al a 


T ^ L 1 

Le u 


Le u 


Ala 


Le u 


A 1 -i 

Ala 


Ala 


be r 


1 






5 










10 










15 




n 1 — 

Ala 


Tn r 


Ser 


A 1 — 

Ala 


Lys 


Tyr 


Le u 


blU 


Le u 


(j 1 u 


Lj 1 U 


p i \ i 
Gly 


oiy 


v a i 


T 1 o 

lie 


Mo t- 








i U 








9 <A 
















Gin 


TV ~\ 

Ala 


Phe 
35 


Tyr 


Trp 


ASp 


val 


D V 

riO 

40 


o e r 


o i y 


biy 


T 1 o 

lie 


l rp 
45 


i rp 


nop 


1 I 1 X 


lie 


Arg 
D U 


Gin 


Lys 


lie 


r r o 


Pin 

U 1 u 


l rp 


1 yr 


Ren 
nSp 


Mia 


p 1 \/ 
\j \j 


T 1 o 
J. X tr 


^ o y 


A 1 3 

nld 


T 1 o 
lie 


Trp 


lie 


Pro 


D y-» 

pro 


ai a 


C r\ v 

o e r 


L«y s 


p l v/ 
\j i y 






P 1 \r 


p l \/ 
uiy 


i yr 


Q o r* 
O tr L 


L Jfcr L 


PI V 

oi y 


b 0 










*7 n 










7 S 










80 


Tyr 


Asp 


r*> v 

Pro 


Tyr 


ASp 


Tyr 


ir n e 


A O rA 
*- 


ije u 


o i y 


Pin 
blU 


i yr 


T w y 

i yr 


P 1 n 

Oil! 




P 1 V 

x y 








ft a 










qn 










95 




Thr 


Val 


Glu 


i n r 


A v /~y 

Arg 


D K a. 

r ne 


vj i y 


o e r 


T \ r a 

LyS 


Pin 

o i n 


Pin 


T oil 
Lj tr U 


T 1 o 
x x c; 


Aon 
no 1 1 


Mo 1 - 


T 1 p 

lie 








100 








105 










110 






as n 


HP K -w 

i n r 


a i a 

1 1 R 




Mia 


i yr 


ij i y 


T 1 o 

12 0 




v a J. 


T 1 p 


A 1 A 


A q o 

12 5 


T 1 p 


Va 1 

vox 


lie 

X X c 


as n 


n 1 S 

i 7 n 


A y- 

Arg 


ft. 1 - 

Mia 


P 1 w 

Lj i y 


p i \/ 


Aon 
nop 

X «J 


T oil 
Lj c U 


Pin 


1 11 P 


flcn 
/i O 1 1 


Prn 

t X \J 

14 0 


Php 

IT 1 1 ~ 


Va 1 

v a x 


G 1 v 

o x y 


Asn 


Tyr 


T K v- 

i n r 


l rp 


Thy 


nop 


tr i 1 tr 


C v- 


T w c 
ijy o 


v d i 


A 1 a 


<ap r 


G 1 v/ 

o x y 


T , \/ c: 
Li y D 


T v r 


Thr 


Ala 


14 5 










150 










155 










160 


As n 


Tyr 


Leu 


ASp 


r ne 


n 1 S 


D T" /A 

r r o 


Moll 


Pin 
o 1 U 


T oil 


n x o 


A 1 A 
n 1 d 


P 1 V/ 


Z\ c rA 
nop 


^p r 


P 1 V 

o x y 








IDJ 










i 7 n 










17 5 




Thr 


Phe 


Gly 


Gly 
180 


i yr 


T""} V 

Pro 


ASp 


lie 


Cys 
185 


n 1 S 


ASp 


Li y s 




T 1 y 5A 

i rp 
190 




P 1 n 
bill 


Tyr 


Trp 


Leu 


Trp 


Ala 


Ser 


Gin 


Glu 


Ser 


Tyr 


Ala 


Ala 


Tyr 


Leu 


Arg 


Ser 


195 










200 










205 








lie 


Gly 
210 


He 


Asp 


Ala 


Trp 


Arg 
215 


Phe 


Asp 


Tyr 


Val 


Lys 
220 


Gly 


Tyr 


Gly 


Ala 


Trp 


Val 


Val 


Lys 


Asp 


Trp 


Leu 


Asn 


Trp 


Trp 


Gly 


Gly 


Trp 


Ala 


Val 


Gly 


225 










230 










235 










240 


Glu 


Tyr 


Trp 


Asp 


Thr 


Asn 


Val 


Asp 


Ala 


Leu 


Leu 


Asn 


Trp 


Ala 


Tyr 


Ser 






24 5 










250 










255 




Ser 


Gly 


Ala 


Lys 


Val 


Phe 


Asp 


Phe 


Pro 


Leu 


Tyr 


Tyr 


Lys 


Met 


Asp 


Ala 






260 










265 










270 






Ala 


Phe 


Asp 
275 


Asn 


Lys 


Asn 


lie 


Pro 
280 


Ala 


Leu 


Val 


Glu 


Ala 
285 


Leu 


Lys 


Asn 


Gly 


Gly 


Thr 


Val 


Val 


Ser 


Arg 


Asp 


Pro 


Phe 


Lys 


Ala 


Val 


Thr 


Phe 


Val 


290 










295 










300 










Ala 


Asn 


His 


Asp 


Thr 


Asp 


He 


lie 


Trp 


Asn 


Lys 


Tyr 


Pro 


Ala 


Tyr 


Ala 


305 










310 










315 










320 


Phe 


lie 


Leu 


Thr 


Tyr 
325 


Glu 


Gly 


Gin 


Pro 


Thr 
330 


lie 


Phe 


Tyr 


Arg 


Asp 
335 


Tyr 


Glu 


Glu 


Trp 


Leu 


Asn 


Lys 


Asp 


Lys 


Leu 


Lys 


Asn 


Leu 


He 


Trp 


He 


His 






340 










345 










350 
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Asp Asn Leu Ala 
355 

Glu Met lie Phe 

370 

Thr Tyr lie Asn 
385 

Pro Lys Phe Ala 

Gly Trp Val Asp 

420 

Ala Pro Ala Tyr 
4 35 

Ser Tyr Cys Gly 
450 



Gly Gly Ser Thr 

360 

Val Arg Asn Gly 

375 

Leu Gly Ser Ser 
390 

Gly Ala Cys He 
405 

Lys Tyr Val Tyr 

Asp Pro Ala Asn 

440 

Val Gly Ser Glu 
455 



Ser He Val Tyr 

Tyr Gly Ser Lys 

380 

Lys Val Gly Arg 
395 

His Glu Tyr Thr 
4 10 

Ser Ser Gly Trp 
425 

Gly Gin Tyr Gly 

Lys Asp Glu Leu 

460 



Tyr Asp Ser Asp 
365 I 

Pro Gly Leu He 

Trp Val Tyr Val 

400 

Gly Asn Leu Gly 
415 

Val Tyr Leu Glu 
430 

Tyr Ser Val Trp 
445 



<210> 15 

<211> 518 

<212> PRT 

<213> Artificial Sequence 



<220> 

<223> synthetic 



<400> 15 




Mat- 

1 


T QI1 

-be? U 




Ala 


Leu 


Gly 


Val 


Pro 








20 


Leu 


Arg 


Gly 


Ala 






35 




Thr 


Ser 


Ala 


Arg 




50 






Gly 


Ala 


Arg 


Phe 


65 








Ala 


Lys 


Tyr 


Leu 


Tyr 


Trp 


Asp 


Val 








100 


Lys 


He 


Pro 


Glu 




115 




Pro 


Ala 


Ser 


Lys 




130 






Tyr 


Asp 


Tyr 


Phe 


145 








Thr 


Arg 


Phe 


Gly 


His 


Ala 


Tyr 


Gly 








180 


Ala 


Gly 


Gly 


Asp 






195 




Thr 


Asp 


Phe 


Ser 




210 






Asp 


Phe 


His 


Pro 


225 








Gly 


Tyr 


Pro 


Asp 


Trp 


Ala 


Ser 


Gin 



Leu 


Ala 


Thr 


Ser 


5 








Asp 


Ala 


Ser 


Thr 


Arg 


Ala 


Ser 


Ala 








40 


Ala 


Ala 


Pro 


Arg 






55 




Pro 


Ser 


Leu 


Val 




70 






Glu 


Leu 


Glu 


Glu 


85 








Pro 


Ser 


Gly 


Gly 


Trp 


Tyr 


Asp 


Ala 








120 


Gly 


Met 


Ser 


Gly 






135 




Asp 


Leu 


Gly 


Glu 




150 






Ser 


Lys 


Gin 


Glu 


1 65 








He 


Lys 


Val 


He 


Leu 


Glu 


Trp 


Asn 








200 


Lys 


Val 


Ala 


Ser 




215 




Asn 


Glu 


Leu 


His 




230 






He 


Cys 


His 


Asp 



245 

Glu Ser Tyr Ala 



Gin Leu Val Ala 
10 

Phe Arg Arg Gly 
25 

Ala Ala Asp Thr 

His Gin His Gin 

60 

Val Cys Ala Ser 
75 

Gly Gly Val He 
90 

He Trp Trp Asp 
105 

Gly He Ser Ala 

Gly Tyr Ser Met 

140 

Tyr Tyr Gin Lys 
155 

Leu He Asn Met 
170 

Ala Asp He Val 
185 

Pro Phe Val Gly 

Gly Lys Tyr Thr 

220 

Ala Gly Asp Ser 
235 

Lys Ser Trp Asp 
250 

Ala Tyr Leu Arg 



Thr Arg Ala Gly 

15 

Ala Ala Gin Gly 
30 

Leu Ser Met Arg 
45 

Gin Ala Arg Arg 

Ala Gly Ala Met 

80 

Met Gin Ala Phe 
95 

Thr lie Arg Gin 
110 

lie Trp lie Pro 
125 

Gly Tyr Asp Pro 

Gly Thr Val Glu 

160 

He Asn Thr Ala 
175 

lie Asn His Arg 
190 

Asp Tyr Thr Trp 
205 

Ala Asn Tyr Leu 

Gly Thr Phe Gly 

240 

Gin Tyr Trp Leu 

255 

Ser lie Gly lie 
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260 










265 










270 






nSp 


A 1 - 

A i a 


I rp 


A y n 

m i g 


tr 1 1 c: 


Mop 


T v r 


Va 1 

V U 1 


I , V/ s 


Gl v 


T vr 
j 


Gl v 


Ala 


Trp 


Val 


Val 














280 










285 








F \ t d 

tys 


Mop 


i rp 




A q n 
noil 


1 x p 


T r o 


Gl v 


Gl v 


Trn 


Ala 


Val 


Gl V 


Glu 


Tyr 


Trp 












2 95 










300 










MS p 


Ti-i r- 
1 1 1 1 


A c n 

ms n 


V cl J. 




/n J. a 


T .P I ) 


I,pn 

J_J V- \_J 


Asn 


Tro 


Ala 


T vr 


Ser 


Ser 


Gly 


Ala 


305 








310 










315 










320 


j_«y s 


v a i 


riie 


Mo p 


rile 


P r r~> 


T P M 


T v r 


Twr 
i y b 


T .v s 


Me t 


Asd 


Ala 


Ala 


Phe 


Asp 






J) eL -J 










330 

—J -J 










335 




7\ y~. 

as n 


Lys 


as n 


lie 


ID y r\ 

r r O 


Ala 
Mia 


T oil 

Li t: U 


v a J. 




A 1 s 

i i. X CI 


T ,P 15 

l—i C UJ 




Asn 

i 1 w J 1 


Gl v 


Gl v 


Thr 






^ d n 










34 5 










350 






v a i 


va i 


o e r 


7\ V yT 

Arg 


A o r\ 
M S p 


D r* /^i 
tr L D 


tilt: 


by o 


Ala 


val 


Th r 
iiii 


Phe 


Val 


Ala 


Asn 


His 






355 








360 
~> \j \j 










365 








AS p 


J nr 


A S p 


T 1 o 


T 1 e 

lit; 


Trn 
i rp 


Ben 


i_> y o 


T v r 
i y x. 


Pro 


Ala 


T vr 


Ala 


Phe 


lie 


Le u 


370 










375 










380 










i nr 


l y r 


O b U 


*o ± y 


U J. 1 1 


P r n 




Tie 


Phe 


T v r 


Ara 


Asd 


Tvr 


Glu 


Glu 


Trp 


_} O _/ 






390 










395 










400 




A c n 

Mo n 


by a 




!_; y o 


T p 1 1 

l_i CT l_J 


i~j y o 


As n 


Tip u 


lie 

-L_ ^_ ~w 


Tr D 


lie 


His 


Asp 


Asn 


Leu 








4 0 5 

T V w 1 










4 10 










4 15 




Ala 




ui y 


Cor 
O C b 


Thr 




T 1 p 


Va 1 

vox 


T v r 


T vr 


Asd 


Ser 


Asd 


Glu 


Met 


lie 




4 ? 0 










4 2 5 










430 






r n e 


v a i 


Arg 

J *1 c. 


Aon 

ms n 


b i y 


T w r 

i yr 


v j. y 


Qpr 

J C 1 

4 4 0 


Two 
j_i y o 


P ro 


Gl v 


Leu 


lie 
4 4 5 


Thr 


Tyr 


lie 


Moll 


J_i U 


ui y 




^ p r 


T , v s 

i_ i y o 


Va 1 

V w* -1- 


Gl v 

y 


Ara 


Tr d 


Val 


Tvr 


Val 


Pro 


Lys 


Phe 




450 








455 










460 










Aia 


Giy 


A 1 -a 

Mia 




1 b t; 


n b o 




T v r 


Thr 


Gl v 

w x y 


As n 


Lpn 


Gl v 


Gl v 


Tro 


Va 1 


4 65 






470 










475 










480 


Asp 


Lys 


Tyr 


Val 


Tyr 


Ser 


Ser 


Gly 


Trp 


Val 


Tyr 


Leu 


Glu 


Ala 


Pro 


Ala 




435 










490 










495 




Tyr 


Asp 


Pro 


Ala 


Asn 


Gly 


Gin 


Tyr 


Gly 


Tyr 


Ser 


Val 


Trp 


Ser 


Tyr 


Cys 




500 










505 










510 






Gly 


Val 


Gly 
515 


Thr 


Ser 


lie 























<210> 16 
<211> 820 
<212> PRT 

<213> Artificial Sequence 

<220> 

<223> synthetic 
<400> 16 



Met 


Leu 


Ala 


Ala 


Leu 


Ala 


Thr 


Ser 


Gin 


Leu 


Val 


Ala 


Thr 


Arg 


Ala 


Gly 


1 








5 










10 










15 




Leu 


Gly 


Val 


Pro 


Asp 


Ala 


Ser 


Thr 


Phe 


Arg 


Arg 


Gly 


Ala 


Ala 


Gin 


Gly 






20 










25 










30 






Leu 


Arg 


Gly 


Ala 


Arg 


Ala 


Ser 


Ala 


Ala 


Ala 


Asp 


Thr 


Leu 


Ser 


Met 


Arg 




35 










40 










45 








Thr 


Ser 


Ala 


Arg 


Ala 


Ala 


Pro 


Arg 


His 


Gin 


His 


Gin 


Gin 


Ala 


Arg 


Arg 




50 








55 










60 










Gly 


Ala 


Arg 


Phe 


Pro 


Ser 


Leu 


Val 


Val 


Cys 


Ala 


Ser 


Ala 


Gly 


Ala 


Met 


65 








70 










75 










80 


Ala 


Lys 


Tyr 


Leu 


Glu 


Leu 


Glu 


Glu 


Gly 


Gly 


Val 


He 


Met 


Gin 


Ala 


Phe 






85 










90 










95 




Tyr 


Trp 


Asp 


Val 


Pro 


Ser 


Gly 


Gly 


He 


Trp 


Trp 


Asp 


Thr 


He 


Arg 


Gin 


100 










105 










110 
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Lys 


He 


Pro 


Glu 






lib 




Pro 


Ala 


Se r 


Lys 




1 jU 






Tyr 


AS p 


Tyr 


rne 


14 3 








i nr 


A v- /~i 

Mi y 


rne 


lj ± y 


MIS 


Mid 


i y r 


oi y 








180 


Ala 


Gly 


Gly 


Asp 






1 !? J 




I n r 


As p 


rne 


o e r 




/. 1 u 






Asp 


Pne 


HIS 


r r o 


/ Z D 








biy 


Tyr 


r r o 


AS p 


1 rp 


a i a 


JCI 


bill 








260 


Asp 


Ala 


Trp 


Arg 






Z / 0 




Lys 


Asp 


Trp 


Leu 




z y u 






Asp 


Thr 


Asn 


v a i 


jU j 








Lys 


val 


Pne 


Asp 


7\ r~s r~\ 


Ly b 


A C r~\ 
A S I ) 


lit; 








340 


Val 


Val 


Ser 


Arg 






3 SS 




ASp 


I nr 


Mop 


T 1 a 
1 x o 




7 7 0 

-J ' w 






i n r 


1 yr 


o 1 U 




ODD 








Leu 


A <r> r-> 

as n 


Lys 


Mop 


A 1 =. 

Ala 


o l y 




O c 1 








420 


Phe 


Val 


Arg 


Asn 






4 35 




as n 




biy 


D c x 




4 SO 

H _> u 






A 1 -i 

Aia 


r* l \/ 
oiy 


Aid 


C \) q 
^yj 


HOD 








Asp 


LyS 


i yr 


V d X 


l yr 


AS p 


D Y~ /~\ 

riO 


A "1 ■= 
A -L d 








son 


Gly 


Val 


Gly 


Thr 






515 




Thr 


Val 


Ser 


Pro 




530 






Gly 


Cys 


Glu 


Leu 


545 








Val 


Asn 


Gly 


Met 



i rp 


T v r 


Asd 


Ala 








120 


Gly 


Met 


Ser 


Gly 






135 




nop 


i-iC u 


o x y 


Glu 




150 

_L w*' \J 






*5 p r 


Lvs 

j — > y %j 


Gin 


Glu 


165 








lie 


Lvs 


Val 


He 


I.pl] 


Glu 

w -1- Vol 


Tro 


Asn 








200 


Lys 


Val 


Ala 


Ser 






215 




AS 1 1 


r; 1 1 1 

o ± u 




His 




2 30 

C— \J 






T 1 p 

IXC 


f v<; 
j 


His 

1 1 1 -J 


Aso 


2 4 5 








Glu 


Spr 


T vr 


Ala 


Phe 


Asd 


T vr 


Val 








280 


Asn 


Trp 


Trp 


Gly 






2 95 




A c; r-i 
Mop 




| ,P1 1 


T.pn 

Xj \^ vJ 




310 

^ X \J 






r l ic 


Pro 


1 I } 

i_J \Ji 


T vr 

y 


3 2 S 

O C ~J 








Pro 


Ala 


Leu 


Val 


As D 


Pro 


Phe 


Lys 








360 


He 


Trp 


Asn 


Lys 






375 




Gin 


Pro 


Thr 

X 1 1 X. 


I le 




390 








Leu 


Lvs 


Asn 


4 0 5 








Thr 

1114- 


Ser 


lie 


Val 


Gl v 


T vr 


Gl V 


Ser 








440 


Ser 


Lys 


Val 


Gly 






4 55 




I le 


His 


Glu 


Tvr 




470 






T v r 


Ser 


Ser 


Gly 


4 85 








ri O l J 




Gin 


Tvr 


Ser 


lie 


Ala 


Gly 








520 


Tyr 


Tyr 


Ala 


Glu 






535 




Asp 


Asn 


He 


Met 




550 






Asp 


Val 


Ser 


Glu 



565 



Gl v 


He 

JL X C 


Ser 


Ala 


Gl v 

\j x y 


Tvr 
i y x 


Ser 


Met 








140 


Tyr 


Tyr 


Gin 


Lys 






155 




Leu 

XJ 


He 

X 1 


Asn 


Met 




170 






Al a 

l 1 mi- ■ * 


Asd 


He 


Val 


185 








Pro 


Phe 


Val 


Gly 


Gl v 


Lvs 


Tvr 
x y 1 - 


Thr 








220 


Ala 


Gly 


Asp 


Ser 






235 




T . V7 s 
x_i y 


Ser 


TrD 


Asd 




250 






Ala 


Tvr 


Leu 


Arg 


265 








I,vs 


Gl v 


Tvr 


Gl v 


Gl v 

x y 


Tro 

J. X p 


Ala 


Va 1 








300 


Asn 


Trp 


Ala 


Tyr 






315 




y 


Lvs 

xj y 


Me t 


Asd 




330 






Glu 

\ — ) _1_ L4 


Ala 


Le u 


Lys 


34 5 








Ala 


Val 


Thr 


Phe 


Tvr 


Pro 


Ala 


Tyr 








380 


Phe 


Tyr 


Arg 


Asp 






395 




Leu 

J_J \hf* W* 


lie 


Tro 


He 




4 10 






Tvr 
y 


Tvr 


Asp 


Ser 


425 








Lys 


Pro 


Gl v 


Leu 


Arg 


Trp 


Val 


Tyr 








460 


Thr 


Gly 


Asn 


Leu 






475 




Tro 

1 X ^ 


Val 


Tvr 


Leu 




490 






Gl v 


Tvr 

i y x 


Ser 


Val 


505 








He 


Leu 


Glu 


Ala 


Glu 


Leu 


He 


Ser 








b 4 U 


Arg 


Leu 


Thr 


Gly 






555 




Trp 


Asp 


Pro 


Ser 




570 








Page 


18 





I le 


Trp 

•*• ^ IT — 


lie 


Pro 


125 








Gl v 


Tyr 


Asp 


Pro 


Gly 


Thr 


Val 


Glu 








160 


lie 


Asn 


Thr 


TV 1 

Ala 






175 




lie 


Asn 


His 


Arg 




190 






Asp 


Tyr 


Thr 


Trp 


205 








Ala 


Asn 


Tyr 


Leu 


Gl v 


Th r 


Phe 


Gly 








240 


Gin 


Tyr 


Trp 


T . m 

Leu 






255 




Ser 


lie 


Gly 


He 




270 






Ala 


Trp 


Val 


Val 


285 








Gly 


Glu 


Tyr 


Trp 


Ser 


Ser 


Gly 


Ala 








320 


Ala 


Ala 


Phe 


Asp 






335 




Asn 


Gly 


Gly 


Thr 




350 






Val 


Ala 


Asn 


His 


365 








Ala 


Phe 


lie 


Leu 


Tyr 


Glu 


Glu 


Trp 








400 


His 


Asp 


Asn 


-r 

Leu 






415 




Asp 


Glu 


Met 


lie 




430 






lie 


Thr 


Tyr 


lie 


445 








Val 


Pro 


Lys 


Phe 


Gly 

mi 


Gly 


Trp 


Val 








480 


Glu 


Ala 


Pro 


Ala 






495 




Trp 


Ser 


Tyr 


Cys 




510 






/» O Up' 


A r a 


Va 1 


Leu 


525 








Gly 


lie 


Ala 


Arg 


He 


Thr 


Gly 


lie 








560 


Arg 


Asp 


Lys 


Tyr 






575 
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He Ala Val Lys 

580 



A n 

Msn 


LyS 


pi n 


A 1 a 

Mia 










T 1 a 
lie 


D v- 


LcU 


v a i 




Dl U 






rtsp 


v a i 


rje L 


A 1 -i 

Mia 


\J (L. ~) 








V a J. 


oin 


lie 


v a l 


Leu 


Met 


Ser 


Ala 








D O V 


Lys 


r n e 


ms n 


A 1 j 

mi a 






D / D 




Leu 


Ax a 


v a i 


i nr 




d y u 






(j i y 


Me t 




l yr 










Vol 


MSp 


T H v- 

i nr 


lie 


Ser 


Val 


Asp 


Cys 










Thr 


Thr 


Leu 


Gin 






755 




Glu 


Met 


Val 


Arg 




770 






Ala 


Lys 


Asn 


Trp 


785 








Glu 


Pro 


Gly 


Val 


Val 


Ala 


Ala 


Pro 



820 



Tyr Asp Val Ser 

Leu Gin Ala Glu 

600 

Ala Phe He Gly 
615 

Ala Ala He Pro 
630 

Leu Leu Gly Thr 
645 

Glu Glu Lys Phe 

Ala Leu Ala His 

680 

Ser Arg Phe Glu 
695 

Gly Thr Pro Cys 
710 

He Glu Gly Lys 
725 

Asn Val Val Glu 

Arg Ala He Lys 

760 

Asn Cys Met He 
7 75 

Glu Asn Val Leu 
790 

Glu Gly Glu Glu 
805 



Thr Ala Val Glu 
585 

Val Gly Leu Pro 

Arg Leu Glu Glu 

620 

Gin Leu Met Glu 
635 

Gly Lys Lys Lys 
650 

Pro Gly Lys Val 
665 

His He Met Ala 

Pro Cys Gly Leu 

700 

Ala Cys Ala Ser 
715 

Thr Gly Phe His 
730 

Pro Ala Asp Val 
745 

Val Val Gly Thr 

Gin Asp Leu Ser 

780 

Leu Ser Leu Gly 
795 

He Ala Pro Leu 
810 



Ala Lys Ala Leu 
590 

Val Asp Arg Asn 
605 

Gin Lys Gly Pro 

Met Val Glu Asp 

640 

Phe Glu Arg Met 
655 

Arg Ala Val Val 
670 

Gly Ala Asp Val 
685 

He Gin Leu Gin 

Thr Gly Gly Leu 

720 

Met Gly Arg Leu 
735 

Lys Lys Val Ala 
750 

Pro Ala Tyr Glu 
765 

Trp Lys Gly Pro 

Val Ala Gly Gly 

800 

Ala Lys Glu Asn 
815 



<210> 17 
<211> 19 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic 
<400> 17 

Met Arg Val Leu Leu Val Ala Leu Ala Leu Leu Ala Leu Ala Ala Ser 

15 10 15 

Ala Thr Ser 



<210> 18 
<211> 444 
<212> PRT 

<213> Thermotoga maritima 
<400> 18 

Met Ala Glu Phe Phe Pro Glu He Pro Lys He Gin Phe Glu Gly Lys 

15 10 15 

Glu Ser Thr Asn Pro Leu Ala Phe Arg Phe Tyr Asp Pro Asn Glu Val 
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20 

lie Asp Gly Lys 
35 

Trp His Thr Phe 

50 

Ala Glu Arg Pro 
65 

Ala Arg Va 1 Asp 

Tyr Phe Cys Phe 

100 

Arg Glu Thr Asn 
115 

Arg Met Lys Asp 
130 

Phe Ser His Pro 
145 

Asp Val Phe Ala 

Thr Lys Glu Leu 

180 

Gly Tyr Glu Thr 
195 

Leu Ala Arg Phe 
210 

Phe Thr Gly Gin 
225 

His Gin Tyr Asp 

His Gly Leu Asp 

260 

Thr Leu Ala Gly 
275 

Leu Gly Lys Leu 
290 

Gly Trp Asp Thr 
305 

Ala Met Tyr Glu 

Asn Phe Asp Ala 

340 

Phe lie Gly His 
355 

He Ala Tyr Lys 
370 

Glu Lys Tyr Arg 
385 

Gly Lys Thr Asp 

Asp He Glu Leu 

420 

Asn Ser Tyr lie 
435 



<210> 19 
<211> 1335 
<212> DNA 



Pro Leu Lys Asp 

40 

Val Asn Glu Gly 
55 

Trp Asn Arg Phe 
70 

Ala Leu Phe Glu 
85 

His Asp Arg Asp 

Lys He Leu Asp 

120 

Ser Asn Val Lys 
135 

Arg Tyr Met His 
150 

Tyr Ala Ala Ala 
165 

Gly Gly Glu Gly 

Leu Leu Asn Thr 

200 

Leu Arg Met Ala 
215 

Phe Leu He Glu 

230 

Phe Asp Val Ala 
245 

Glu Tyr Phe Lys 

His Thr Phe Gin 

280 

Gly Ser lie Asp 
295 

Asp Gin Phe Pro 
310 

Val He Lys Ala 
325 

Lys Val Arg Arg 

He Ala Gly Met 

360 

Leu Ala Lys Asp 
375 

Ser Phe Lys Glu 
390 

Phe Glu Lys Leu 
405 

Pro Ser Gly Lys 

Val Lys Thr He 

440 



25 



HIS 


Leu 


Lys 


D K q 

r ne 


Arg 


Asp 


Pro 


Phe 








60 


Q o r 

o e it 


MS p 


t 1 o 








1 D 




r ne 


tys 


C 1 1 1 
o 1 U 


Lys 




y U 






T ) r 

lie 


Mia 


D v- e~\ 

r r o 


o 1 U 


1 U J 








T i i r 

j_i y s 


Val 


w ^ 1 
V a X 


o 1 U 


Leu 


Leu 


Trp 


Gly 








140 


b 1 y 


mi a 


A 1 - 

Mia 


Th k 






l j j 




bin 


Val 


T \;c 

Lys 


j_» y o 




1 / U 






I yr 


va i 


r n e 


l rp 


IOC 

lob 








Asp 


Leu 


b i y 


Leu 


Val 


Glu 


Tyr 


Ala 








220 


tro 


LyS 


D v /-\ 
t L O 


Lys 






9 S 




T h >- 

i n r 


Ala 
Mid 


T \/ r 

i y r 


Ala 
nid 




z ju 






Php 
n ic 


Den 
/noil 


He 


Glu 


9 £ ^ 
Z D D 








n x o 


u x u 


i-j C; KJ. 


Arn 


Ala 


Asn 


Gin 


Gly 








300 


1 I 1 L 


Aon 

Moll 


T 1 p 


T v r 






j l j 




C-, 1 V 
u i y 


nj x y 


Phe 


Thr 




J JU 






A 1 A 




T v r 
i yx 


T ■ v ^ 


j^i J 








A o r\ 
Mop 


Thr 


n ic 


Al a 
/i x a 


Gly 


Val 


Phe 


Asp 








380 


Gly 


lie 


Gly 


Lys 






395 




Glu 


Glu 


Tyr 


He 




4 10 






Gin 


Glu 


Tyr 


Leu 


425 








Ala 


Glu 


Leu 


Arg 




Page 


20 







i n 






Cor 


v a x 


A 1 ^ 
/i x a 


Php 

IT J 1 ~ 


4 5 








r: 1 \ / 
oiy 


M O J-J 


Pro 


Thr 


Mop 




Ala 


Phe 








80 


Leu 


Asn 


He 


Glu 










Gly 


Lys 


Thr 


7 oil 




i 1 n 

1 1 u 






A >^ ^-f 

Mr g 


Tin 

lie 


Lys 


C 1 n 


1 2 S 

X C. ^ 








Thr 
1 III 


A 1 A 
M X a 


A c; n 
/ i O I 1 


T .p« 1 1 

1_J LJ 


Thr 
1 1 1 X 


^ys 


Q <=i r 


Ala 






160 


Ala 


Leu 


Glu 


He 






1 7 S 

X / w> 




b i y 


uiy 


Arn 

mx y 


Ul u 




X .7 \J 






fin 
b 1 U 


Lc U 


ul U 


Moll 


£. U D 








Lys 


T wo 

Lys 


T 1 <=> 


ui y 


nin 


Pro 


Thr 


T . v s 

J — > y 








240 


Phe 


Leu 


Lys 


Asn 






255 




A 1 a 
m x a 


it O 1 1 


His 

I 1 -L O 


Ala 




270 








Ala 
x a 


Ar a 


lie 

X -L >w 


2ft 5 

L U J 








A c: n 


T <2> 1 1 


T pn 

LiC U 


T.pi] 

XJ V>1 


B cr, 
o ^ 


Thr 


Thr 


T ,p u 

JLJ L4 








320 


Lys 


Gly 


Gly 


Leu 






335 

~J *J 




u= l 

v a J- 


Glu 


A ? o 


Le u 




350 






T pn 


Gl v 
ox y 


Phe 


T .v s 


J U J 








Lys 


D h i=i 

r ne 


T 1 Q 

lie 


H 1 \ i 
b 1 U 


Glu 


He 


Val 


Glu 








400 


He 


Asp 


Lys 


Glu 






4 15 




Glu 


Ser 


Leu 


Leu 




430 
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<213> Thermotoga maritima 



<400> 19 

atggccgagt tcttcccgga gatcccgaag atccagttcg agggcaagga gtccaccaac 60 

ccgctcgcct tccgcttcta cgacccgaac gaggtgatcg acggcaagcc gctcaaggac 120 

cacctcaagt tctccgtggc cttctggcac accttcgtga acgagggccg cgacccgttc 180 

ggcgacccga ccgccgagcg cccgtggaac cgcttctccg acccgatgga caaggccttc 240 

gcccgcgtgg acgccctctt cgagttctgc gagaagctca acatcgagta cttctgcttc 300 

cacgaccgcg acatcgcccc ggagggcaag accctccgcg agaccaacaa gatcctcgac 360 

aaggtggtgg agcgcatcaa ggagcgcatg aaggactcca acgtgaagct cctctggggc 420 

accgccaacc tcttctccca cccgcgctac atgcacggcg ccgccaccac ctgctccgcc 480 

gacgtgttcg cctacgccgc cgcccaggtg aagaaggccc tggagatcac caaggagctg 540 

ggcggcgagg gctacgtgtt ctggggcggc cgcgagggct acgagaccct cctcaacacc 600 

gacctcggcc tggagctgga gaacctcgcc cgcttcctcc gcatggccgt ggagtacgcc 660 

aagaagatcg gcttcaccgg ccagttcctc atcgagccga agccgaagga gccgaccaag 720 

caccagtacg acttcgacgt ggccaccgcc tacgccttcc tcaagaacca cggcctcgac 780 

gagtacttca agttcaacat cgaggccaac cacqccaccc tcgccggcca caccttccag 840 

cacgagctgc gcatggcccg catcctcggc aagctcggct ccatcgacgc caaccagggc 900 

gacctcctcc tcggctggga caccgaccag ttcccgacca acatctacga caccaccctc 960 

gccatgtacg aggtgatcaa ggccggcggc ttcaccaagg gcggcctcaa cttcgacgcc 1020 

aaggtgcgcc gcgcctccta caaggtggag gacctcttca tcggccacat cgccggcatg 1080 

gacaccttcg ccctcggctt caagatcgcc tacaagctcg ccaaggacgg cgtgttcgac 1140 

aagttcatcg aggagaagta ccgctccttc aaggagggca tcggcaagga gatcgtggag 1200 

ggcaagaccg acttcgagaa gctggaggag tacatcatcg acaaggagga catcgagctg 1260 

ccgtccggca agcaggagta cctggagtcc ctcctcaact cctacatcgt gaagaccatc 1320 
gccgagctgc gctga 1335 

<210> 20 
<211> 4 4 4 
< 2 1 2 > PRT 

<213> Thermotoga neapolitana 
<400> 20 

Met Ala Glu Phe Phe Pro Glu lie Pro Lys Val Gin Phe Glu Gly Lys 

15 10 15 

Glu Ser Thr Asn Pro Leu Ala Phe Lys Phe Tyr Asp Pro Glu Glu lie 

20 25 30 

lie Asp Gly Lys Pro Leu Lys Asp His Leu Lys Phe Ser Val Ala Phe 

35 40 45 

Trp Kis Thr Phe Val Asn Glu Gly Arg Asp Pro Phe Gly Asp Pro Thr 

50 55 60 

Ala Asp Arg Pro Trp Asn Arg Tyr Thr Asp Pro Met Asp Lys Ala Phe 
65 70 75 80 

Ala Arg Val Asp Ala Leu Phe Glu Phe Cys Glu Lys Leu Asn lie Glu 

85 90 95 

Tyr Phe Cys Phe His Asp Arg Asp He Ala Pro Glu Gly Lys Thr Leu 

100 105 110 

Arg Glu Thr Asn Lys He Leu Asp Lys Val Val Glu Arg He Lys Glu 

115 120 125 

Arg Met Lys Asp Ser Asn Val Lys Leu Leu Trp Gly Thr Ala Asn Leu 

130 135 140 

Phe Ser His Pro Arg Tyr Met His Gly Ala Ala Thr Thr Cys Ser Ala 
145 150 155 160 

Asp Val Phe Ala Tyr Ala Ala Ala Gin Val Lys Lys Ala Leu Glu He 

165 170 175 

Thr Lys Glu Leu Gly Gly Glu Gly Tyr Val Phe Trp Gly Gly Arg Glu 

180 185 190 

Gly Tyr Glu Thr Leu Leu Asn Thr Asp Leu Gly Phe Glu Leu Glu Asn 
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195 










200 


Leu 


Ala 


Arg 


Phe 


Leu 


Arg 


Met 


Ala 




210 








215 




Phe 


Thr 


Gly 


Gin 


Phe 


Leu 


He 


Glu 


225 










2 30 






His 


Gin 


Tyr 


Asp 


Phe 
2 4 5 


Asp 


Va 1 


TV 1 _ 

Ala 


His 


Gly 


Leu 


Asp 
260 


Glu 


Tyr 


Phe 


T 

Lys 


Thr 


Leu 


Ala 


Gly 


His 


Thr 


Phe 


Gin 






275 








280 


Leu 


Gly 


Lys 


Leu 


Gly 


Ser 


He 


Asp 




290 








295 




Gly 


Trp 


Asp 


Thr 


Asp 


Gin 


Phe 


Pro 


305 










310 






Ala 


Met 


Tyr 


Glu 


Val 


He 


Lys 


Ala 








325 








Asn 


Phe 


Asp 


Ala 
34 0 


Lys 


mm ^ 

Val 


Arg 


Arg 


Phe 


He 


Gly 
355 


His 


He 


Ala 


Gly 


Met 

360 


Val 


Ala 


Tyr 


Lys 


Leu 


Val 


Lys 


Asp 




370 








375 




Glu 


Lys 


Tyr 


Arg 


Ser 


Phe 


Arg 


Glu 


385 








390 






Gly 


Lys 


Val 


Asp 


Phe 


Glu 


Lys 


Leu 








405 








Thr 


He 


Glu 


Leu 

420 


Pro 


Ser 


Gly 


Lys 


Asn 


Ser 


Tyr 
435 


He 


Val 


Lys 


Thr 


He 
440 



205 



Val 


Asp 


Tyr 


Ala 

£. <i U 


Lys 


Arg 


He 


Gly 


Pro 


Lys 


Pro 


Lys 


Glu 


Pro 


Thr 


Lys 




o o c 










s> a o 


Thr 


Ala 


Tyr 


Ala 


Phe 


Leu 


Lys 


Ser 




2 5 0 








£ Z) +j 




Phe 


Asn 


lie 


fin 


M 1 <3 


Ben 
r\ O 1 1 


n jl o 


A 1 a 


n r c 

2 65 










9 7 0 






HIS 




Lie u 


A y- /~t 
MlCj 


9 ft ^ 
o D 


A 1 ^ 




T le 


Ala 


As n 


bin 


l> i y 

~i C\C\ 

J I) U 


MS p 


T oil 


JU c u 


T . O M 


Thr 


Asn 


Val 

J 1 J 


Tyr 


Asp 


Thr 


Thr 


Leu 

._> £ w 


Gly 


Gly 


r ne 


i n r 


Ly5 


oi y 


uiy 


T pii 


O "5 A 

330 










7 "3 c > 
j j j 




Ala 


Ser 


Tyr 


Lys 


v a l 


o 1 U 


Ren 
MSp 


T oil 


34 5 










■3C.O 

J jU 






Asp 


i n r 


O V-^ A-v 

rne 


Mia 


LcU 


o i y 


Php 


by j 








"3 6 s 

J D J 








biy 


Val 


1j t: U 


380 


T,\;c 
L/y 3 


Php 


He 


Glu 


Gly 


He 


Gly 


Arg 


Asp 


He 


Val 


Glu 




395 










400 


Glu 


Glu 


Tyr 


He 


He 


Asp 


Lys 


Glu 




410 








4 15 




Gin 


Glu 


Tyr 


Leu 


Glu 


Ser 


Leu 


He 


425 










4 30 






Leu 


Glu 


Leu 


Arg 











<210> 21 
<211> 1335 
<212> DNA 

<213> Thermotoga neapolitana 



<400> 21 

atggccgagt tcttcccgga gatcccgaag gtgcagttcg agggcaagga gtccaccaac 

ccgctcgcct tcaagttcta cgacccggag gagatcatcg acggcaagcc gctcaaggac 120 

cacctcaagt tctccgtggc cttctggcac accttcgtga aegagggecg cgacccgttc 180 



60 



ggcgacccga ccgccgaccg cccgtggaac cgctacaccg acccgatgga caaggccttc 240 
gcccgcgtgg acgccctctt cgagttctgc gagaagctca acatcgagta cttctgcttc 300 
cacgaccgcg acatcgcccc ggagggcaag accctccgcg agaccaacaa gatcctcgac 360 



aaggtggtgg agegcatcaa ggagcgcatg aaggactcca aegtgaaget cctctggggc 420 



gacgtgttcg cctacgccgc cgcccaggtg aagaaggece tggagatcac caaggagctg d4U 
ggeggegagg gctacgtgtt ctggggcggc cgegaggget acgagaccct cctcaacacc 600 
gacctcggct tcgagctgga gaacctcgcc cgcttcctcc gcatggccgt ggactacgcc o6u 



aagegcateg gcttcaccgg ccagttcctc ategagcega agecgaagga gccgaccaag 720 

caccagtacg acttcgacgt ggccaccgcc tacgccttcc tcaagtccca cggcctcgac 780 

gagtacttca agttcaacat cgaggccaac cacgccaccc tcgccggcca caccttccag 840 

cacgagctgc gcatggcccg catcctcggc aagctegget ccatcgacgc caaccagggc 900 

gacctcctcc tcggctggga caccgaccag ttcccgacca acgtgtacga caccaccctc 960 

gccatgtacg aggtgatcaa ggccggcggc ttcaccaagg gcggcctcaa cttcgacgcc 1020 

aaggtgcgcc gcgcctccta caaggtggag gacctcttca tcggccacat cgccggcatg 1080 

gacaccttcg ccctcggctt caaggtggcc tacaagctcg tgaaggaegg cgtgctcgac 1140 
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28 



29 



aagttcatcg aggagaagta ccgctccttc cgcgagggca tcggccgcga catcgtggag 1200 

ggcaaggtgg acttcgagaa gctggaggag tacatcatcg acaaggagac catcgagctg 1260 

ccgtccggca agcaggagta cctggagtcc ctcatcaact cctacatcgt gaagaccatc 1320 

ctggagctgc gctga 1335 

<210> 22 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic 
<400> 22 

agcgaattca tggcggctct ggccacgt 

<210> 23 
<211> 2 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic 
<400> 23 

agctaagctt cagggcgcgg ccacgttct 

<210> 24 
<211> 825 
<212> PRT 

<213> Artificial Sequence 

<220> 

<223> synthetic 
<400> 24 

Met Arg Val Leu Leu Val Ala Leu Ala Leu Leu Ala Leu Ala Ala Ser 

15 10 15 

Ala Thr Ser Ala Gly His Trp Tyr Lys His Gin Arg Ala Tyr Gin Phe 

20 25 30 

Thr Gly Glu Asp Asp Phe Gly Lys Val Ala Val Val Lys Leu Pro Met 

35 40 45 

Asp Leu Thr Lys Val Gly lie lie Val Arg Leu Asn Glu Trp Gin Ala 

50 55 60 

Lys Asp Val Ala Lys Asp Arg Phe He Glu He Lys Asp Gly Lys Ala 
65 70 75 80 

Glu Val Trp He Leu Gin Gly Val Glu Glu He Phe Tyr Glu Lys Pro 

85 90 95 

Asp Thr Ser Pro Arg He Phe Phe Ala Gin Ala Arg Ser Asn Lys Val 

100 105 110 

He Glu Ala Phe Leu Thr Asn Pro Val Asp Thr Lys Lys Lys Glu Leu 

115 120 125 

Phe Lys Val Thr Val Asp Gly Lys Glu He Pro Val Ser Arg Val Glu 

130 135 140 

Lys Ala Asp Pro Thr Asp lie Asp Val Thr Asn Tyr Val Arg He Val 
145 150 155 160 

Leu Ser Glu Ser Leu Lys Glu Glu Asp Leu Arg Lys Asp Val Glu Leu 

165 170 175 
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T 1 0 

lie 


T 1 c 

lie 


La 1 U 


vj x y 








180 


Asp 


Asp 


Tyr 


Tyr 






X 2? >J 




Lys 


I n r 


T 1 o 

lie 


Pho 
rile 




Z X U 






leu 


LcU 


tr 1 1 e 


T w c: 
i_i y 0 










Mo t- 

ne u 


fin 

\s: 1 U 


T v r~ 

i yr 


by 0 


Leu 


MSp 




v ct x 








260 


Arg 


Thr 


Thr 


Val 






97c, 

Z / D 




Glu 


O x-^ -w- 

be r 


A 1 = 

Ax a 


v a 1 










Asn 


ASp 


Arg 


ui y 


jU j 








<o 1 U 


T T o 

lie 


n x 


T 1 0 

lie 


J\ r> n 

ms n 


Li y 5 


oiy 


T oil 








340 


Gly 


Gly 


Val 


Thr 






J J J 




His 


Val 


HIS 


lie 




J> / u 






ASp 


i_»y i3 


A S p 




IRS 








rr 11 e 




V^ 1 


pro 


TK r* 
1 I 1 L 


A >- n 


Tie 


A r a 








420 


Gly 


He 


Gly 


Val 






4 35 




bi y 


u i u 




oc 1 




A SO 






lie 


Mop 


Xj y 0 


Th r 


H O -J 








T 1 A 

lie 


A 1 ;=) 
n 1 d 


O *— X 


"O J- u 


l 1 1 1 


i yi 


Trn 

1 J. p 


Va 1 








500 


Met 


Gly 


Leu 


lie 










HIS 




T 1 0 


Acn 




»J _> \J 






IIP 


ri 

uly 


A 1 a 


Pro 
r 1 u 


^ ^ ^ 








va i 


Mia 


A 1 A 
Mia 


Ph 0 


?he 


Asn 


r r 0 


C 0 v 








580 


Thr 


Lys 


He 


Lys 






595 




Leu 


lie 


Lys 


Ser 




610 






Ala 


Cys 


His 


Asp 


625 









T v r 

1 yx 


T,vS 


Pro 


Ala 


T v r 
1 y x 


AS D 


Gly 


Glu 






200 


Arg 


Val 


Trp 


Ser 






215 






Gl v 


Glu 


Asp 




230 






G 1 v 


Asn 


Gl v 


Val 


2 4 5 








Phe 


T vr 

1 y x. 


Leu 


Tvr 


A s 0 


Pro 


T vr 


Ser 








280 


Val 


Asn 


Leu 


Ala 






295 

*s 




Pro 




T1p 

X U- w 


Glu 




310 






Ala 


As D 


He 


Thr 


32 5 








T vr 

1 y x 


Leu 


Gl v 


Leu 


Thr 

1111 


Gl v 

-x, y 


Leu 


Ser 








360 


Leu 


Pro 


Phe 


Phe 






37 5 






ij y j 


T vr 


Tvr 




390 






Glu 


Gl v 


Arq 


Tyr 


4 05 








Glu 


Val 


Lys 


Glu 


lie 


Met 


Asp 


Met 








440 


Ala 


Phe 


Asp 


Gin 






4"55 




Gl v 


Ala 


T vr 

y 


Leu 




4 70 






Arg 


Pro 


Met 


Met 


4 85 








Lvs 


Glu 


Tyr 


His 


Asp 


Lys 


Lys 


Thr 








520 


Pro 


Thr 


He 


He 






535 




I "i e 


Ar a 


Phe 


Gly 




550 








As D 


Glu 


Phe 


5 65 

\J w/ 








v d x 


Lj y 0 


ui y 


Phe 

L 1 1 ^Z* 


Arg 


Gly 


Val 


Val 








600 


Phe 


Ala 


Leu 


Asp 






615 




Asn 


His 


Thr 


Leu 




630 







Ara 


Val 


lie 


Met 


185 








Leu 


Gly 


Ala 


Val 


Pro 


Val 


Ser 


Lys 








220 


Thr 


Glu 


Pro 


Tyr 






235 




Trp 


Glu 


Ala 


Val 




250 






Gin 


Leu 


Glu 


Asn 


265 








Lvs 


Ala 


Val 


Tyr 


Arg 


Thr 


Asn 


Pro 








300 


Gly 


Tyr 


Glu 


Asp 






315 




Gl V 


Leu 


Glu 


Asn 




330 






Thr 


Glu 


Glu 


Asn 


34 5 








His 


Leu 


Val 


Glu 


Asp 


Phe 


Tyr 


Thr 








380 


Asn 


Trp 


Gly 


Tyr 






395 




Ser 


Thr 


Asp 


Pro 




410 






Met 


Val 


Lys 


Ala 


425 








Val 


Phe 


Pro 


His 


Thr 


Val 


Pro 


Tyr 








460 


Asn 


Glu 


Ser 


Gly 






475 




Arg 


Lys 


Phe 


lie 




490 






lie 


Asp 


Gly 


Phe 


505 








Met 


Leu 


Glu 


Val 


Leu 


Tyr 


Gly 


Glu 








540 


Lys 


Ser 


Asp 


Va 1 






555 




Arg 


Asp 


Ala 


lie 




570 






Val 


Met 


Gly 


Gly 


s ft S 
JO J 








Gly 


Ser 


lie 


Asn 


Pro 


Glu 


Glu 


Thr 








620 


Trp 


Asp 


Lys 


Asn 






635 






Page 


24 





Met 


Glu 


He 


Leu 




190 






Tyr 


Ser 


Pro 


Glu 


205 








Trp 


Val 


Lys 

mi 


Val 


Gin 


Val 


Val 


Asn 








240 


Va 1 


Glu 


Gly 


ASp 






255 




Tyr 


Gly 


Lys 

ml 


He 




270 






Ala 


Asn 


Asn 


Gin 


285 








Glu 


Gly 


Trp 


Glu 


Ala 


He 


lie 


Tyr 








320 


Ser 


r* i . ■ 

Gly 


Va 1 


Lys 






335 




Thr 


Lys 


Ala 


Pro 




350 






Leu 


Gly 


Val 


Thr 


365 








Gly 


Asp 


Glu 


Leu 


Asp 


Pro 


Tyr 


Leu 








400 


T . . — 

Lys 


Asn 


Pro 


HIS 






4 15 




Leu 


His 


Lys 


His 




4 30 






Thr 


Tyr 


Gly 


lie 


445 








Tyr 


Phe 


Tyr 


Arg 


Cys 


Gly 


Asn 


Val 








480 


Va 1 


Asp 


rnr 


v a 1 






4 95 




Arg 

ml 


Phe 


Asp 


Gin 




510 






Glu 


Arg 


Ala 


Leu 


525 








Pro 


Trp 


Gly 


Gly 


Ala 


Gly 

mi 


Thr 


His 








560 


Arg 


Gly 


Ser 


Val 






575 




1 y x 


Gl v 


Lvs 


Glu 




590 






Tyr 


Asp 


Gly 


Lys 


605 








lie 


Asn 


Tyr 


Ala 


Tyr 


Leu 


Ala 


Ala 






640 
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Lys 


Ala 


Asp 


Lys 


Lys 


Lys 


Glu 


Trp 


Thr 


Glu 


Glu 


Glu 


Leu 


Lys 


Asn 


Ala 




64 5 










650 










655 




Gin 


Lys 


Leu 


Ala 


Gly 


Ala 


lie 


Leu 


Leu 


Thr 


Ser 


Gin 


Gly 


Val 


Pro 


Phe 






660 








665 










670 






Le u 


His 


Gl v 


Gly 


Gin 


Asp 


Phe 


Cys 


Arg 


Thr 


Thr 


Asn 


Phe 


Asn 


Asp 


Asn 






675 








680 










685 








Se r 


Tvr 


Asn 


Ala 


Pro 


lie 


Ser 


He 


Asn 


Gly 


Phe 


Asp 


Tyr 


Glu 


Arg 


Lys 




690 










695 










700 








Leu 


Leu 


G 1 n 


Phe 


lie 


Asp 


Val 


Phe 


Asn 


Tyr 


His 


Lys 


Gly 


Leu 


He 


Lys 


705 








710 










715 








He 


720 


Arg 


Lvs 


Glu 


His 


Pro 


Ala 


Phe 


Arg 


Leu 


Lys 


Asn 


Ala 


Glu 


Glu 


Lys 






725 










730 










735 




Lvs 


His 


Leu 


Glu 


Phe 


Leu 


Pro 


Gly 


Gly 


Arg 


Arg 


He 


Val 


Ala 


Phe 


Met 






740 










745 










750 






Leu 


Lys 


Asp 


His 


Ala 


Gly 


Gly 


Asp 


Pro 


Trp 


Lys 


Asp 


He 


Val 


Val 


He 




755 










760 










765 




Gly 




Tvr 


Asn 


Gly 


Asn 


Leu 


Glu 


Lys 


Thr 


Thr 


Tyr 


Lys 


Leu 


Pro 


Glu 


Lys 


770 








775 










780 








Glu 


Trp 


Asn 


Val 


Val 


Val 


Asn 


Ser 


Gin 


Lys 


Ala 


Gly 


Thr 


Glu 


Val 


lie 


785 










790 










795 








Val 


800 


Thr 


Val 


Glu 


Gly 


Thr 


lie 


Glu 


Leu 


Asp 


Pro 


Leu 


Ser 


Ala 


Tyr 


Leu 








805 










810 










815 




Tyr 


Arg 


Glu 


Ser 


Glu 


Lys 


Asp 


Glu 


Leu 


















820 










825 

















<210> 25 
<211> 2478 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic 
<400> 25 

atgagggtgt tgctcgttgc cctcgctctc ctggctctcg ctgcgagcgc caccagcgct 60 

ggccactggt acaagcacca gcgcgcctac cagttcaccg gcgaggacga cttcgggaag 120 

gtggccgtgg tgaagctccc gatggacctc accaaggtgg gcatcatcgt gcgcctcaac 180 

gagtggcagg cgaaggacgt ggccaaggac cgcttcatcg agatcaagga cggcaaggcc 240 

gaggtgtgga tactccaggg cgtggaggag atcttctacg agaagccgga cacctccccg 300 

cgcatcttct tcgcccaggc ccgctccaac aaggtgatcg aggccttcct caccaacccg 360 

gtggacacca agaagaagga gctgttcaag gtgaccgtcg acggcaagga gatcccggtg 420 

tcccgcgtgg agaaggccga cccgaccgac atcgacgtga ccaactacgt gcgcatcgtg 480 

ctctccgagt ccctcaagga ggaggacctc cgcaaggacg tggagctgat catcgagggc 540 

tacaagccgg cccgcgtgat catgatggag atcctcgacg actactacta cgacggcgag 600 

ctgggagcgg tgtactcccc ggagaagacc atcttccgcg tgtggtcccc ggtgtccaag 660 

tgggtgaagg tgctcctctt caagaacggc gaggacaccg agccgtacca ggtggtgaac 720 

atagagtaca agggcaacgg cgtgtgggag gccgtggtgg agggcgacct cgacggcgtg 780 

ttctacctct accagctgga gaactacggc aagatccgca ccaccgtgga cccgtactcc 840 

aaggccgtgt acqccaacaa ccaggagtct gcagtggtga acctcgcccg caccaacccg 900 

gagggctggg agaacgaccg cggcccgaag atcgagggct acgaggacgc catcatctac 960 

gagatccaca tcgccgacat caccggcctg gagaactccg gcgtgaagaa caagggcctc 1020 

tacctcggcc tcaccgagga gaacaccaag gccccgggcg gcgtgaccac cggcctctcc 1080 

cacctcgtgg agctgggcgt gacccacgtg cacatcctcc cgttcttcga cttctacacc 1140 

ggcgacgagc tggacaagga cttcgagaag tactacaact ggggctacga cccgtacctc 1200 

ttcatggtgc cggagggccg ctactccacc gacccgaaga acccgcacac ccgaattcgc 1260 

gaggtgaagg agatggtgaa ggccctccac aagcacggca tcggcgtgat catggacatg 1320 

gtgttcccgc acacctacgg catcggcgag ctgtccgcct tcgaccagac cgtgccgtac 1380 
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tacttctacc gcatcgacaa gaccggcgcc tacctcaacg agtccggctg cggcaacgtg 1440 

atcgcctccg agcgcccgat gatgcgcaag ttcatcgtgg acaccgtgac ctactgggtg 1500 

aaggagtacc acatcgacgg cttccgcttc gaccagatgg gcctcatcga caagaagacc 1560 

atgctggagg tggagcgcgc cctccacaag atcgacccga ccatcatcct ctacggcgag 1620 

ccgtggggcg gctggggggc cccgatccgc ttcggcaagt ccgacgtggc cggcacccac 1680 

gtggccgcct tcaacgacga gttccgcgac gccatccgcg gctccgtgtt caacccqtcc 1740 

gtgaagggct tcgtgatggg cggctacggc aaggagacca agatcaagcg cggcgtggtg 1800 

ggctccatca actacgacgg caagctcatc aagtccttcg ccctcgaccc gqaggagacc 1860 

atcaactacg ccgcctgcca cgacaaccac accctctggg acaagaacta cctcgccgcc 1920 

aaggccgaca agaagaagga gtggaccgag gaggagctga agaacgccca gaagctcgcc 1980 

ggcgccatcc tcctcactag tcagggcgtg ccgttcctcc acggcggcca ggacttctgc 2040 

cqcaccacca acttcaacga caactcctac aacqccccqa tctccatcaa cggcttcgac 2100 

tacgagcgca agctccagtt catcgacgtg ttcaactacc acaagggcct catcaagctc 2160 

cgcaaggagc acccqqcctt ccgcctcaag aacgccgagg agatcaagaa gcacctggag 2220 

ttcctcccgg gcgggcgccg catcgtggcc ttcatgctca aggaccacgc cggcggcgac 2280 

ccgtggaagg acatcgtggt gatctacaac ggcaacctgg agaagaccac ctacaagctc 2340 

ccggagggca agtggaacgt ggtggtgaac tcccagaagg ccggcaccga ggtgatcgag 2400 

accqtqqaqg ocaccatcqa gctggacccg ctctccgcct acgtgctcta ccgcgagtcc 2460 

gagaaggacg agctgtga ^4/0 

<210> 26 
<211> 718 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> synthetic 



<400> 26 



Met 


Arg 


Val 


Leu 


Leu 
5 


Val 


Ala 


Leu 


1 

Ala 


Thr 


Ser 


Met 


Glu 


Thr 


He 


Lys 








20 










Lys 


Val 


Val 


He 


Gly 


Glu 


Pro 


Phe 




35 










40 


Gin 


Lys 


He 


Ser 


Ser 


Asn 


Lys 


Ser 




50 










55 




Val 


Gin 


Gin 


Gly 


Asn 


Lys 


Val 


He 


65 










70 






Glu 


His 


He 


lie 


Gly 


Leu 


Gly 


Glu 










85 








Arg 


Lys 


Arg 


Tyr 


Val 


Met 


Tyr 


Asn 








100 










Tyr 


Gin 


Asp 


Pro 


Leu 


Tyr 


Val 


Ser 




115 










120 


Asp 


Gly 


Val 


Ala 


Thr 


Gly 


Tyr 


Phe 


130 










135 




Phe 


Asp 


Val 


Gly 


Leu 


Glu 


Glu 


Tyr 


14 5 








150 






Glu 


Asp 


Ser 


Val 


Glu 


Phe 


Tyr 


Val 








165 








Val 


Leu 


Glu 


Lys 


Tyr 


Thr 


Glu 


Leu 








180 










Met 


Trp 


Ala 


Phe 


Gly 


Tyr 


Met 


lie 




195 










200 


Asp 


Lys 


Val 


Val 


Glu 


Leu 


Val 


Asp 


210 










215 




Val 


Ala 


Gly 


Val 


Phe 


Leu 


Asp 


He 



Ala 


Leu 


Leu 


Ala 


Leu 


Ala 


Ala 


Ser 




10 










15 




He 


Tyr 


Glu 


Asn 


Lys 


Gly 


Val 


Tyr 


25 










30 






Pro 


Pro 


lie 


Glu 


Phe 
45 


Pro 


Leu 


Glu 


Leu 


Ser 


Glu 


Leu 


Gly 


Leu 


Thr 


lie 








60 










Val 


Glu 


Lys 


Ser 


Leu 


Asp 


Leu 


Lys 






75 










80 


Lys 


Ala 


Phe 


Glu 


Leu 


Asp 


Arg 


Lys 


90 










95 




Val 


Asp 


Ala 


Gly 


Ala 


Tyr 


Lys 


Lys 


105 










110 






lie 


Pro 


Leu 


Phe 


He 


Ser 


Val 


Lys 










125 








Phe 


Asn 


Ser 


Ala 


Ser 


Lys 


Val 


lie 








140 










Asp 


Lys 


Val 


lie 


Val 


Thr 


He 


Pro 


155 










160 


lie 


Glu 


Gly 


Pro 


Arg 


lie 


Glu 


Asp 




170 










175 




Thr 


Gly 


Lys 


Pro 


Phe 


Leu 


Pro 


Pro 


185 










190 






Ser 


Arg 


Tyr 


Ser 


Tyr 


Tyr 


Pro 


Gin 










205 








lie 


Met 


Gin 


Lys 


Glu 


Gly 


Phe 


Arg 








220 










His 


Tyr 


Met 


Asp 


Ser 


Tyr 


Lys 


Leu 
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225 

Phe Thr Trp His 

Glu Leu His Lys 

260 

lie Arg Val Asp 

275 

Phe Cys Glu Tie 
290 

Gly Thr Thr Val 

305 

Trp Ala Gly Leu 

Trp Leu Asp Met 

340 

Arg Asp Val Leu 
355 

Val Thr Thr Phe 
370 

Val Lys His Glu 

385 

Ala Thr Phe Lys 

Leu Ser Arg Ala 

420 

Thr Gly Asp Asn 
435 

Leu Val Leu Gly 
450 

He Gly Gly Phe 
465 

Asp Leu Leu Val 

Arg Ser His Lys 

500 

Pro Asp Tyr Tyr 
515 

Lys Phe Leu Pro 
530 

Gly His Pro Val 
545 

Asp Met Tyr Arg 

Tyr Ala Pro He 

580 

Arg Gly Lys Trp 

595 

Ser Val Val Lys 
610 

Ser He He Pro 
625 

Ser Phe Lys Arg 

lie Lys Phe Ser 

660 

Glu Lys Pro Val 
675 

Val Glu Lys Thr 



230 

Pro Tyr Arg Phe 

245 

Arg Asn Val Lys 

Gin Asn Tyr Ser 

280 

Glu Ser Gly Glu 
295 

Tyr Pro Asp Phe 
310 

He Ser Glu Trp 

325 

Asn Glu Pro Thr 

Ser Ser Leu Pro 

360 

Pro Asp Asn Val 
375 

Lys Val Arg Asn 
390 

Gly Phe Arg Thr 
405 

Gly Tyr Ala Gly 

Thr Pro Ser Trp 

4 40 

Leu Ser He Ser 
455 

Gin Gly Arg Asn 
470 

Lys Tyr Tyr Ala 
485 

Ala Thr Asp Gly 

Lys Glu Lys Val 

520 

Tyr lie Tyr Ser 
535 

lie Arg Pro Leu 
550 

lie Glu Asp Glu 
565 

Val Ser Lys Glu 

Tyr Asn Tyr Trp 

600 

Ser Thr His Glu 
615 

Leu Glu Gly Asp 
630 

Tyr Asp Asn Ala 
645 

Arg Glu He Tyr 

Ser Lys lie lie 

680 

Met Gin Asn Thr 



235 

Pro Glu Pro Lys 
250 

Leu lie Thr lie 

265 

Pro Phe Leu Ser 

Leu Phe Val Gly 

300 

Phe Arg Glu Asp 
315 

Leu Ser Gin Gly 
330 

Asp Phe Ser Arg 
345 

Val Gin Phe Arg 

Val His Tyr Leu 

380 

Ala Tyr Pro Leu 
395 

Ser His Arg Asn 
410 

lie Gin Arg Tyr 
425 

Asp Asp Leu Lys 

Gly Val Pro Phe 

460 

Phe Ala Glu lie 
475 

Leu Ala Leu Phe 
490 

lie Asp Thr Glu 
505 

Lys Glu lie Val 

Leu Ala Leu Glu 

540 

Phe Tyr Glu Phe 
555 

Tyr Met Val Gly 

570 

Glu Ser Arg Leu 
585 

Asn Gly Glu lie 

Leu Pro lie Tyr 

620 

Glu Leu lie Val 
635 

Glu lie Thr Ser 
650 

Val Ser Lys Leu 
665 

Val Asp Asp Ser 
Tyr Val Ala Lys 



240 

Lys Leu lie Asp 
255 

Val Asp His Gly 
270 

Gly Met Gly Lys 
285 

Lys Met Trp Pro 

Thr Arg Glu Trp 

320 

Val Asp Gly lie 
335 

Ala lie Glu lie 
350 

Asp Asp Arg Leu 
365 

Arg Gly Lys Arg 

Tyr Glu Ala Met 

400 

Glu He Phe lie 
4 15 

Ala Phe lie Trp 
430 

Leu Gin Leu Gin 

445 

Val Gly Cys Asp 

Asp Asn Ser Met 

480 

Phe Pro Phe Tyr 
495 

Pro Val Phe Leu 
510 

Glu Leu Arg Tyr 

525 

Ala Ser Glu Lys 

Gin Asp Asp Asp 

560 

Lys Tyr Leu Leu 
575 

Val Thr Leu Pro 
590 

lie Asn Gly Lys 
605 

Leu Arg Glu Gly 

Tyr Gly Glu Thr 

640 

Ser Ser Asn Glu 
655 

Thr lie Thr Ser 

670 

Lys Glu lie Gin 
685 

lie Asn Gin Lys 
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690 

He Arg Gly Lys 

705 



<210> 27 
<211> 712 
<212> PRT 
<213> Artificia 

<220> 

<223> synthetic 



<400> 27 



Met 
1 


Arg 


Val 


Leu 


Ala 


Thr 


Ser 


Met 








20 


Lys 


Val 


Va 1 


-w- 1 

lie 




3 0 




Gin 


Lys 


lie 


Ser 










Va 1 


Gin 


Gin 


r i ,, 
biy 


65 








Glu 


His 


He 


He 


Arg 


Lys 


Arg 


Tyr 








inn 


Tyr 


Gin 


Asp 


n ^h. 

Pro 






1 1 0 




Asp 


Gly 


Va 1 


Ala 




i on 
1 jU 






Phe 


Asp 


val 


bi y 


14 5 








Glu 


Asp 


Ser 


Val 


Val 


Leu 


Glu 


Lys 








180 


Met 


Trp 


Ala 


Phe 






195 




Asp 


Lys 


Val 


Val 




210 






Val 


Ala 


Gly 


Val 


225 








Phe 


Thr 


Trp 


His 


Glu 


Leu 


His 


Lys 








260 


He 


Arg 


Val 


Asp 






275 




Phe 


Cys 


Glu 


He 




290 






Gly 


Thr 


Thr 


Val 


305 








Trp 


Ala 


Gly 


Leu 


Trp 


Leu 


Asp 


Met 



340 



695 

He Asn Leu Glu 
710 



Sequence 



T 

Leu 


Va 1 


Ala 


Leu 


5 








Glu 


Thr 


He 


Lys 


Gly 


Glu 


X~% W% 

Pro 


rne 








a n 


C* y-, -w- 

Ser 


Asn 


i»y ^ 


Car 

o s r 






C. C 

□ <J 




As n 


Lys 


v a i 


T 1 *=> 




7 n 






<j1 y 


u 


ui y 




85 








Val 


Met 


Tyr 


Asn 


Leu 


1 yr 


v a i 


Q o r" 










i n r 


biy 


i y r 


DKp 
Lilt; 






135 

X 




Leu 


Pin 
\j x U 


O 1 u 


y 




1 SO 

X 






o 1 U 


tile 


T \/ r 
1 yi 


V^ 1 


1 a c 








Tyr 


Thr 


Glu 


Leu 


Gly 


Tyr 


Met 


He 








200 


Glu 


Leu 


Val 


Asp 






215 




Phe 


Leu 


Asp 


lie 




230 






Pro 


Tyr 


Arg 


Phe 


24 5 








Arg 


Asn 


Val 


Lys 


Gin 


Asn 


Tyr 


Ser 








280 


Glu 


Ser 


Gly 


Glu 






295 




Tyr 


Pro 


Asp 


Phe 




310 






He 


Ser 


Glu 


Trp 


325 








Asn 


Glu 


Pro 


Thr 



700 

Ser Glu Lys Asp 
715 



Ala Leu Leu Ala 
10 

He Tyr Glu Asn 
25 

Pro Pro He Glu 

Leu Ser Glu Leu 

60 

Val Glu Lys Ser 
75 

Lys Ala Phe Glu 
90 

Val Asp Ala Gly 
105 

He Pro Leu Phe 

Phe Asn Ser Ala 

14 0 

Asp Lys Val lie 
155 

He Glu Gly Pro 
170 

Thr Gly Lys Pro 
185 

Ser Arg Tyr Ser 

He Met Gin Lys 

220 

His Tyr Met Asp 
235 

Pro Glu Pro Lys 
250 

Leu He Thr He 
265 

Pro Phe Leu Ser 

Leu Phe Val Gly 

300 

Phe Arg Glu Asp 
315 

Leu Ser Gin Gly 
330 

Asp Phe Ser Arg 
345 



Glu Leu 



T on 


A 1 ^ 
r\ X d 


A 1 a 

xi X CI 


Ser 






X 




T \ / c: 
by o 


ui y 


Va 1 

V CI X 


Tvr 




30 






r nu 


Pro 


Le u 


Glu 


45 








Gly 


Leu 


Thr 


He 


T u 


Asd 


Leu 


Lys 








80 


Leu 


Asp 


Arg 


Lys 






95 




Ala 


Tvr 


Lys 


Lys 




110 






He 


Ser 


Val 


Lys 


125 








Ser 


Lys 


Val 


He 


Va 1 


Thr 


I le 


Pro 








160 


Arg 


He 


Glu 


Asp 






175 






T 1 1 

Xj ~ u 


Pro 

L X w 


Pro 




190 






Tyr 


Tyr 


Pro 


Gin 


205 








Glu 


Gly 


Phe 


Arg 


Ser 


Tyr 


Lys 


Leu 








240 


Lys 


Leu 


He 


Asp 






255 




Val 


Asp 


His 


Gly 




270 






Gly 


Met 


Gly 


Lys 


285 








Lys 


Met 


Trp 


Pro 


Thr 


Arg 


Glu 


Trp 








320 


Val 


Asp 


Gly 


lie 






335 




Ala 


lie 


Glu 


lie 




350 
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A r n 


A en 


Va 1 


l—i C 






_> _/ vJ 




Va 1 


Thr 


Thr 


Phe 




1 7 n 






V a J. 




H i s 

I 1 i. o 


Glu 


~ ft S 

_j O J 








A "I a 


Thr 


Phe 

L J 1 


Lvs 


Leu 


Ser 


Arg 


Ala 








420 


J. J 1 -L 


o i y 




Asn 






4 1 S 

4 J — 1 




l_i \Z Li 


Va 1 


T,pn 


Gl V 




4 SO 

4 *J VJ 






T 1 p 


n 1 V 
ui y 


Gl v 


Phe 

Lit ^_ - 


4 ft S 

4 O .J 








A S D 


Leu 


Leu 


Val 


Arg 


Ser 


His 


Lys 








500 


Pro 




i y x 


T v r 
i y x. 






S 1 S 

ji. j 




T v s 


Phe 


Leu 


Pro 




S 30 

J J u 






ri u 
ox y 


His 


Pro 


Va 1 


S 4 S 








fieri 


Met 


T v r 


Arg 


Tyr 


Ala 


Pro 


He 






580 


A >- n 
MI Lj 


Gly 


by o 


T r d 






S Q S 
j j 




o C I 


Va 1 

vox 


Va 1 


Lvs 




ft 1 0 






O t. i- 


He 


He 


Pro 


v 








Ser 


Phe 


Lys 


Arg 


He 


Lys 


Phe 


Ser 






660 


Glu 


Lys 


Pro 


Val 






675 




Val 


Glu 


Lys 


Thr 




690 






lie 


Arg 


Gly 


Lys 


705 









<210> 28 
<211> 469 
<212> PRT 
<213> Artificial 

<220> 

<223> synthetic 

<400> 28 

Met Arg Val Leu 



Ser 


Ser 


Leu 


Pro 








360 


Pro 


Asp 


Asn 


Val 






375 




Lvs 

— j y ' — j 


Val 


Arq 


Asn 




390 






Gl v 


Phe 


Arg 


Thr 


4 0 5 








Gly 


Tyr 


Ala 


Gly 


Thr 


Pro 


Ser 


Trp 








4 4 0 


Leu 


Ser 


He 


Ser 






4 55 




Gin 


Gly 


Arg 


Asn 




4 70 






Lvs 


Tyr 


Tyr 


Ala 


485 








Ala 


Thr 


Asp 


Gly 


Lys 


Glu 


Lys 


Val 








520 


Tyr 


He 


Tyr 


Ser 






535 




He 


Arg 


Pro 


Leu 




550 






He 


Glu 


Asp 


Glu 


565 








Val 


Ser 


Lys 


Glu 


Tyr 


Asn 


Tyr 


Trp 






r r\ r\ 

600 


Ser 


Thr 


His 


Glu 






615 




Le u 


Glu 


Gly 


Asp 




630 






Tyr 


Asp 


Asn 


Ala 


64 5 








Arg 


Glu 


He 


Tyr 


Ser 


Lys 


He 


He 








680 


Met 


Gin 


Asn 


Thr 






695 




He 


Asn 


Leu 


Glu 




710 







Sequence 



Leu Val Ala Leu 



Val Gin Phe Arg 

Val His Tyr Leu 

380 

Ala Tyr Pro Leu 
395 

Ser His Arg Asn 
4 10 

lie Gin Arg Tyr 
425 

Asp Asp Leu Lys 

Gly Val Pro Phe 

4 60 

Phe Ala Glu He 
475 

Leu Ala Leu Phe 
490 

He Asp Thr Glu 

505 

Lys Glu He Val 

Leu Ala Leu Glu 

540 

Phe Tyr Glu Phe 
555 

Tyr Met Val Gly 
570 

Glu Ser Arg Leu 
585 

Asn Gly Glu lie 

Leu Pro lie Tyr 

620 

Glu Leu He Val 
635 

Glu lie Thr Ser 
650 

Val Ser Lys Leu 
665 

Val Asp Asp Ser 

Tyr Val Ala Lys 

700 



Ala Leu Leu Ala 



Asp 


Asp 


Arg 


Leu 


365 








Arg 


Gly 


Lys 


Arg 


Tyr 


Glu 


Ala 


Met 








4 no 


Glu 


lie 


Phe 


lie 






415 




Ala 


Phe 


lie 


Trp 




4 30 






Leu 


Gin 


Leu 


Gin 


4 4 5 








Val 


Gly 


Cys 


Asp 


Asp 


Asn 


Ser 


Met 








d ft n 

4 0 U 


Phe 


Pro 


Phe 


Tyr 






495 




Pro 


Val 


Phe 


Leu 




510 






Glu 


Leu 


Arg 


Tyr 


525 








Ala 


Ser 


Glu 


Lys 


Gin 


Asp 


Asp 


Asp 








S 60 

J u w 


Lys 


Tyr 


Leu 


Leu 






575 




Val 


Thr 


Leu 


Pro 




590 






lie 


Asn 


Gly 


Lys 


605 








Leu 


Arg 


Glu 


Gly 


Tyr 


Gly 


Glu 


Thr 








640 


Ser 


Ser 


Asn 


Glu 






655 




Thr 


lie 


Thr 


Ser 




670 






Lys 


Glu 


lie 


Gin 


685 








lie 


Asn 


Gin 


Lys 



Leu Ala Ala Ser 
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Ala 


Thr 


Ser 


Met 








20 


Glu 


G 1 v 


j_j y *— > 


Glu 






3 5 




i l O 1 1 


Glu 


Va 1 


He 

>*L -k- "w 




50 






Va 1 


Ala 

i J -k_ W* 


Phe 


Trp 


65 








As D 

li ^-J f^-f 


Pro 


Thr 


Ala 


Lys 


Ala 


Phe 


Ala 






100 


Asn 


T 1 p 

X X 


Glu 


T vr 






1 1 s 




j-j y o 


Thr 


XJ i-J 


Ara 




1 30 

x _> w 






He 


Ly s 


Glu 


Arg 


1 4 5 








n x o 




I jP u 


Phe 


Cys 


Ser 


Ala 


Asp 






180 


T oil 
i_j tr u 


Gin 

O X U 


Tie 


Thr 

x J L X. 






1 95 




Gl V 


Ara 


Glu 

-^_J -L. \^-4 


Gl V 




2 10 






T P ij 


Glu 


Asn 


Le u 


225 








Ly s 


He 


Gly 


Phe 


Pro 


Thr 


Lys 


His 








260 


T on 
j_j vzr L-* 


j / y «-_> 


Asn 


His 






27 5 




Asn 


His 


Ala 


Thr 




2 90 






Ala 


Arg 


He 


Leu 


305 

W 








Leu 


Leu 


Leu 


Gly 


Thr 


Thr 


Leu 


Ala 








34 0 


Gl v 


Gl v 


Leu 


Asn 






355 




Glu 


Asp 


Leu 


Phe 




37 0 






Gl v 


Phe 


L v s 


lie 


38 5 








Phe 


He 


Glu 


Glu 


lie 


Val 


Glu 


Gly 








420 


Asp 


Lys 


Glu 


Asp 






435 




Ser 


Leu 


Leu 


Asn 




450 






Glu 


Lys 


Asp 


Glu 



Ala 


Glu 


Phe 


Phe 


Ser 


Thr 


Asn 


Pro 








H V 


Asp 


Gly 


Lys 


Pro 






55 




His 


Thr 


Phe 


Val 




70 






Glu 


Arg 


Pro 


Trp 


85 








Arg 


Val 


Asp 


Ala 


Phe 


Cys 


Phe 


His 






1 O (""1 

120 


Glu 


Thr 


Asn 


Lys 






135 




Met 


Lys 


Asp 


Ser 




150 






Ser 


His 


Pro 


Arg 


165 








Val 


Phe 


Ala 


Tyr 


Lys 


Glu 


Leu 


Gly 








o n n 
zUU 


Tyr 

-* 


Glu 


Thr 


Leu 




215 




Ala 


Arg 


Phe 


Leu 




230 






Thr 


Gly 


Gin 


Phe 


245 








Gin 


Tyr 


Asp 


Phe 


Gly 


Leu 


Asp 


Glu 








o q n 


Leu 


Ala 


Gly 


His 






295 




Gly 


Lys 


Leu 


Gly 




310 






Trp 


Asp 


Thr 


Asp 


325 








Met 


Tyr 


Glu 


Val 


Phe 


Asp 


Ala 


Lys 








^ & D 


He 


Gly 


His 


lie 






375 




Ala 


Tyr 


Lys 


Leu 




390 






Lys 


Tyr 


Arg 


Ser 


405 








Lys 


Thr 


Asp 


Phe 


lie 


Glu 


Leu 


Pro 








440 


Ser 


Tyr 


lie 


Val 






455 





Leu 





10 






Pro 


Glu 


lie 


Pro 


25 








Leu 


Ala 


Phe 


Arg 


Leu 


Lys 


Asp 


His 








60 


as n 


olLl 


o i y 


Arg 






75 




Asn 


Arg 


Phe 


Ser 




90 






Leu 


Phe 


Glu 


Phe 


105 








Asp 


Arg 


Asp 


lie 


lie 


Leu 


Asp 


Lys 








140 


Asn 


vai 


Lys 


Le u 






155 




Tyr 


Met 


His 


Gly 




170 






Ala 


Ala 


Ala 


Gin 


185 








Gly 


Glu 


Gly 


Tyr 


Leu 


Asn 


Thr 


Asp 








220 


Arg 


Me t 


Aia 


v a i 






235 




Leu 


lie 


Glu 


Pro 




250 






Asp 


Val 


Ala 


Thr 


265 








Tyr 


Phe 


Lys 


Phe 


Thr 


Phe 


Gin 


His 








300 


C* v> 

be r 


lie 


AS p 


Ala 






315 




Gin 


Phe 


Pro 


Thr 




330 






lie 


Lys 


Ala 


Gly 


345 








Val 


Arg 


Arg 


Ala 


Ala 


Gly 


Met 


Asp 








380 


Ala 


Lys 


AS p 


Lj i y 






395 




Phe 


Lys 


Glu 


Gly 




4 10 






Glu 


Lys 


Leu 


Glu 


425 








O C; X 


Gl v 

\_> x y 


T ■ v 

x-i _y o 


Gin 


Lys 


Thr 


lie 


Ala 






4 60 




Page 


30 





Lys 


He 


Gin 


Phe 


30 






Phe 


Tyr 


Asp 


Pro 


45 








Leu 


Lys 


Phe 


Ser 


Asp 


Pro 


Phe 


Gly 








ft 0 


Asp 


Pro 


Met 


Asp 






95 




Cys 


Glu 


Lys 


Leu 




110 






Ala 


Pro 


Glu 


Gly 


125 








Val 


Val 


Glu 


Arg 


Leu 


Trp 


Gly 


Thr 








160 


Ala 


Ala 


Thr 


Thr 






175 




Val 


Lys 


Lys 


Ala 




190 






Val 


Phe 


Trp 


Gly 


205 








Leu 


Gly 


Leu 


Glu 


Glu 


Tyr 


Ala 


Lys 








2 4 0 


Lys 


Pro 


Lys 


Glu 






255 




Ala 


Tyr 


Ala 


Phe 




270 






Asn 


lie 


Glu 


Ala 


285 








Glu 


Leu 


Arg 


Met 


Asn 


Gin 


Gly 


Asp 








320 


Asn 


lie 


Tyr 


Asp 






335 




Gly 


Phe 


Thr 


Lys 




350 






Ser 


Tyr 


Lys 


Val 


365 








Thr 


Phe 


Ala 


Leu 


Val 


Phe 


Asp 


Lys 








400 


lie 


Gly 


Lys 


Glu 






4 15 




Glu 


Tyr 


lie 


lie 




430 






Glu 


Tyr 


Leu 


Glu 


445 








Glu 


Leu 


Arg 


Ser 
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4 65 



<210> 29 
<211> 469 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic 



<400> 29 



Met 


Arg 


Va 1 


Leu 


Leu 


v a i 


A 1 A 


T <=> 1 1 

LiC U 


Ala 


Leu 

-i—i \w« 


1 






c. 
-> 










10 


Ala 


1 n r 


C o v 


Mo f- 

Z U 


r\ i el 




Phe 


Phe 


Pro 

2 5 


Glu 


Glu 


Gly 


Lys 


Glu 


Ser 


Thr 


Asn 


Pro 


Leu 


Ala 














4 0 






Glu 


blU 

D u 


T 1 


T 1 P 


Aqn 


Gl v 


Lys 
55 


Pro 


Leu 


Lys 


v a i 


Mia 


r 1 1 fcr 


Tv-n 
1 I p 


His 

n i. o 


Thr 


Phe 


Val 


Asn 


Glu 












70 








Arg 


Asp 


Dv- n 


Thr 


Ala 


As D 


Ar q 


Pro 


Trp 


Asn 








Q S 
O U 










90 


Lys 


A 1 n 

Mia 


Phe 




A r a 


Val 


Asp 


Ala 


Leu 


Phe 






i on 










105 




A e n 

ms n 


He 


m ii 

\j i u 


T v r 


Phe 


C vs 


Phe 


His 


Asp 


Arg 






1 1 R 
1 1 j 








120 






Lys 


Thr 


Leu 


Arg 


Glu 


Thr 


Asn 


Lys 


He 


Leu 




i 7 n 










135 






Val 


1 -L c 


Lys 




Ar a 


Me t 


LVS 


Asp 


Ser 


Asn 










150 






• 




Ala 


Aon 
Moll 


l_i c u 


Phe 


Ser 


His 


Pro 


Arg 


Tyr 


Met 








16 5 










170 


Cys 


Ser 


Ala 


Asp 


Val 


Phe 


Ala 


Tyr 


Ala 


Ala 






180 

_L w V-x 










185 




Leu 


Glu 


He 

a. 

1 Z> -J 


Thr 


Lvs 


Glu 


Leu 


Gly 
200 


Gly 


Glu 


Gly 


Arg 


Glu 


Gly 


Tyr 


Glu 


Thr 


Leu 


Leu 


Asn 


210 










215 








Leu 


Glu 


Asn 


Leu 


Ala 


Arg 


Phe 


Leu 


Arg 


Met 


225 










230 








He 


Arg 


lie 


Gly 


Phe 


Thr 


Gly 


Gin 


Phe 


Leu 






24 5 










250 


Pro 


Thr 


Lys 


His 


Gin 


Tyr 


Asp 


Phe 


Asp 


Val 






260 










265 




Leu 


Lys 


Ser 


His 


Gly 


Leu 


Asp 


Glu 


Tyr 


Phe 




275 










280 






Asn 


His 
290 


Ala 


Thr 


Leu 


Ala 


Gly 
295 


His 


Thr 


Phe 


Ala 


Arg 


He 


Leu 


Gly 


Lys 


Leu 


Gly 


Ser 


He 


305 








310 








Phe 


Leu 


Leu 


Leu 


Gly 


Trp 


Asp 


Thr 


Asp 


Gin 








325 










330 


Thr 


Thr 


Leu 


Ala 
340 


Met 


Tyr 


Glu 


Val 


He 
34 5 


Lys 


Gly 


Gly 


Leu 


Asn 


Phe 


Asp 


Ala 


Lys 


Val 


Arg 


355 










360 
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T.pn 

1— J \ — - 'wJ 


Ala 


Leu 


Ala 


Ala 


Ser 










15 




He 

-i- 


Pro 


Lvs 


Val 


Gin 


Phe 








30 






Phe 


Lvs 


Phe 


Tyr 


Asp 


Pro 






4 5 








Asp 


His 


Leu 


Lys 


Phe 


Ser 




60 










Gly 


Arg 


Asp 


Pro 


Phe 


Gly 


75 










80 


T vr 


Thr 


Asp 

IT 


Pro 


Met 


Asp 








95 




Glu 


Phe 


Cys 


Glu 


Lys 


Leu 








110 






Asp 


He 


Ala 


Pro 


Glu 


Gly 




125 








Asp 


Lys 


Val 


Val 


Glu 


Arg 




140 










Lys 


Leu 


Leu 


Trp 


Gly 


Thr 


155 










160 


His 


Gly 


Ala 


Ala 


Thr 


Thr 








175 




Ala 


Gin 


Val 


Lys 


Lys 


Ala 








190 






Gly 


Tyr 


Val 


Phe 


Trp 


Gly 






205 








Thr 


Asp 


Leu 


Gly 


Phe 


Glu 




220 










Ala 


Val 


Asp 


Tyr 


Ala 


Lys 


235 










240 


Glu 


Pro 


Lys 


Pro 


Lys 


Glu 










255 




Ala 


Thr 


Ala 


Tyr 


Ala 


Phe 








270 






Lys 


Phe 


Asn 


He 


Glu 


Ala 




285 








Gin 


His 


Glu 


Leu 


Arg 


Met 




300 










Asp 


Ala 


Asn 


Gin 


Gly 


Asp 


315 










320 


Pro 


Thr 


Asn 


Val 


Tyr 


Asp 










335 




Ala 


Gly 


Gly 


Phe 


Thr 


Lys 








350 






Arg 


Ala 


Ser 


Tyr 


Lys 


Val 



365 
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Glu 


Asp 
370 


Leu 


Phe 


lie 


Gly 


His 

375 


lie 


Gly 


Phe 


Lys 


Val 


Ala 


Tyr 


Lys 


Leu 


385 










390 






Phe 


lie 


Glu 


Glu 


Lys 
405 


Tyr 


Arg 


Ser 


lie 


Val 


Glu 


Gly 
420 


Lys 


Val 


Asp 


Phe 


Asd 


Lys 


Glu 


Thr 


He 


Glu 


Leu 


Pro 


435 










4 40 


Ser 


Leu 
450 


He 


Asn 


Ser 


Tyr 


He 
455 


Val 


Glu 


Lys 


Asp 


Glu 


Leu 









465 



<210> 30 
<211> 463 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic 



<400> 30 



Met 


Arg 


Val 


Leu 


Leu 
5 


Val 


Ala 


Leu 


1 

Ala 


Thr 


Ser 


Met 


Ala 


Glu 


Phe 


Phe 








20 










Glu 


Gly 


Lys 


Glu 


Ser 


Thr 


Asn 


Pro 




35 










40 


Glu 


Glu 


He 


He 


Asp 


Gly 


Lys 


Pro 




50 










55 




Val 


Ala 


Phe 


Trp 


His 


Thr 


Phe 


Val 


65 










70 






Asp 


Pro 


Thr 


Ala 


Asp 


Arg 


Pro 


Trp 








85 








Lys 


Ala 


Phe 


Ala 


Arg 


Val 


Asp 


Ala 






100 










Asn 


He 


Glu 


Tyr 


Phe 


Cys 


Phe 


His 






115 








120 


Lys 


Thr 


Leu 


Arg 


Glu 


Thr 


Asn 


Lys 


130 










135 




He 


Lys 


Glu 


Arg 


Met 


Lys 


Asp 


Ser 


145 








150 






Ala 


Asn 


Leu 


Phe 


Ser 


His 


Pro 


Arg 










165 








Cys 


Ser 


Ala 


Asp 


Val 


Phe 


Ala 


Tyr 






180 










Leu 


Glu 


He 


Thr 


Lys 


Glu 


Leu 


Gly 






195 








200 


Gly 


Arg 


Glu 


Gly 


Tyr 


Glu 


Thr 


Leu 


210 










215 




Leu 


Glu 


Asn 


Leu 


Ala 


Arg 


Phe 


Leu 


225 










230 






Arg 


He 


Gly 


Phe 


Thr 


Gly 


Gin 


Phe 








245 








Pro 


Thr 


Lys 


His 


Gin 


Tyr 


Asp 


Phe 



Ala 


Gly 


Met 


Asp 
380 


Thr 


Phe 


Ala 


Leu 


Val 


Lys 


Asp 


Gly 


Val 


Leu 


Asp 


Lys 




395 










400 


Phe 


Arg 
4 10 


Glu 


Gly 


He 


Gly 


Arg 
4 15 


Asp 


Glu 


Lys 


Leu 


Glu 


Glu 


Tyr 


He 


He 


425 








430 






Ser 


Gly 


Lys 


Gin 


Glu 
445 


Tyr 


Leu 


Glu 


Lys 


Thr 


He 


Leu 


Glu 


Leu 


Arg 


Ser 






460 











Ala 


Leu 
10 


Leu 


Ala 


Leu 


Ala 


Ala 
15 


Ser 


Pro 


Glu 


He 


Pro 


Lys 


Val 


Gin 


Phe 


25 










30 






Leu 


Ala 


Phe 


Lys 


Phe 
45 


Tyr 


Asp 


Pro 


Leu 


Lys 


Asp 


His 
60 


Leu 


Lys 


Phe 


Ser 


Asn 


Glu 


Gly 
75 


Arg 


Asp 


Pro 


Phe 


Gly 
80 


Asn 


Arg 
90 


Tyr 


Thr 


Asp 


Pro 


Met 
95 


Asp 


Leu 


Phe 


Glu 


Phe 


Cys 


Glu 


Lys 


Leu 


105 










110 






Asp 


Arg 


Asp 


He 


Ala 

125 


Pro 


Glu 


Gly 


He 


Leu 


Asp 


Lys 
140 


Val 


Val 


Glu 


Arg 


Asn 


Val 


Lys 
155 


Leu 


Leu 


Trp 


Gly 


Thr 
160 


Tyr 


Met 


His 


Gly 


Ala 


Ala 


Thr 


Thr 


170 










175 




Ala 


Ala 


Ala 


Gin 


Val 


Lys 


Lys 


Ala 


185 










190 






Gly 


Glu 


Gly 


Tyr 


Val 
205 


Phe 


Trp 


Gly 


Leu 


Asn 


Thr 


Asp 
220 


Leu 


Gly 


Phe 


Glu 


Arg 


Met 


Ala 


Val 


Asp 


Tyr 


Ala 


Lys 




235 










240 


Leu 


He 
250 


Glu 


Pro 


Lys 


Pro 


Lys 
255 


Glu 


Asp 


Val 


Ala 


Thr 


Ala 


Tyr 


Ala 


Phe 
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260 



Leu 


Lys 


Ser 


His 


Gly 


Leu 


Asp 


Glu 




275 










280 


Asn 


His 

290 


Ala 


Thr 


Leu 


Ala 


Gly 
295 


His 


Ala 


Arg 


lie 


Leu 


Gly 


Lys 


Leu 


Gly 


305 








310 






Leu 


Leu 


Leu 


Gly 


Trp 
325 


Asp 


Thr 


Asp 


Thr 


Thr 


Leu 


Ala 
340 


Met 


Tyr 


Glu 


Val 


Gly 


Gly 


Leu 


Asn 


Phe 


Asp 


Ala 


Lys 


355 










360 


Glu 


Asp 
370 


Leu 


Phe 


He 


Gly 


His 
375 


He 


Gly 


Phe 


Lys 


Val 


Ala 


Tyr 


Lys 


Leu 


38 5 










390 






Phe 


lie 


Glu 


Glu 


Lys 
405 


Tyr 


Arg 


Ser 


lie 


Val 


Glu 


Gly 
420 


Lys 


Val 


Asp 


Phe 


Asp 


Lys 


Glu 


Thr 


He 


Glu 


Leu 


Pro 


435 










440 


Ser 


Leu 
450 


lie 


Asn 


Ser 


Tyr 


He 
455 


Val 



265 










270 






Tyr 


Phe 


Lys 


Phe 


Asn 


He 


Glu 


Ala 








285 








Thr 


Phe 


Gin 


His 
300 


Glu 


Leu 


Arg 


Met 


Ser 


He 


Asp 
315 


Ala 


Asn 


Gin 


Gly 


Asp 
320 


Gin 


Phe 

330 


Pro 


Thr 


Asn 


Val 


Tyr 

335 


Asp 


He 


Lys 


Ala 


Gly 


Gly 


Phe 


Thr 


Lys 


345 










350 






Val 


Arg 


Arg 


Ala 


Ser 
365 


Tyr 


Lys 


Val 


Ala 


Gly 


Met 


Asp 
380 


Thr 


Phe 


Ala 


Leu 


Val 


Lys 


Asp 


Gly 


Val 


Leu 


Asp 


Lys 




395 










400 


Phe 


Arg 
410 


Glu 


Gly 


He 


Gly 


Arg 
415 


Asp 


Glu 


Lys 


Leu 


Glu 


Glu 


Tyr 


He 


He 


425 








430 






Ser 


Gly 


Lys 


Gin 


Glu 
445 


Tyr 


Leu 


Glu 


Lys 


Thr 


He 


Leu 


Glu 


Leu 


Arg 





460 



<210> 31 
<21i> 25 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic 
<400> 31 

Met Gly Lys Asn Gly Asn Leu Cys 

1 5 
Leu Ala Gly Leu Ala Ser Gly His 

20 



Cys Phe Ser Leu Leu Leu Leu Leu 
10 15 

Gin 
25 



<210> 32 
<211> 30 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic 
<400> 32 

Met Gly Phe Val Leu Phe Ser Gin 

1 5 
Thr Leu Leu Leu Phe Leu Val He 

20 



Leu Pro Ser Phe Leu Leu Val Ser 

10 15 
Ser His Ser Cys Arg Ala 
25 30 



<210> 33 
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<21I> 460 

<212> PRT 

<213> Artificia 

<220> 

<223> synthetic 



<400> 33 



Met 
1 


Arg 


Val 


Leu 


Ala 


Thr 


Ser 


Ala 








20 


Gin 


Ala 


Phe 


Tyr 






35 




I le 


Arg 


Gin 


Lys 




50 ~ 






Trp 


He 


Pro 


Pro 


65 








Tyr 


Asp 


Pro 


Tyr 


Thr 


Va 1 


GlU 


T" 1 V> -v- 

i n r 








1 UU 


Asn 


Thr 


Ala 


His 






115 




Asn 


His 


Arg 


Ala 




130 






Tyr 


Thr 


Ttp 


*~P l—» V 

1 n r 


«» * r*" 

145 








Asn 


Tyr 


Leu 


ASp 


Thr 


Phe 


Gly 


Gly 








loU 


Tyr 

-> 


Trp 

L 


Leu 


Trp 






195 




I le 


Gly 


lie 


As p 




210 






Trp 


va i 


val 


Lys 


225 








Glu 


Tyr 


m _ 

Trp 


Asp 


Ser 


Gly 


Ala 


Lys 








2 60 


Ala 


Phe 


Asp 


Asn 






275 




Gly 


Gly 


Thr 


Val 




290 






Ala 


Asn 


His 


Asp 


305 








Phe 


lie 


Leu 


Thr 


Glu 


Glu 


Trp 


Leu 








340 


Asp 


Asn 


Leu 


Ala 






355 




Glu 


Met 


He 


Phe 




370 






Thr 


Tyr 


He 


Asn 


335 









Sequence 



T 

Leu 


IT "1 

Va 1 


Ala 


Le u 


5 








Lys 


1 y r 


Leu 


(j 1 U 


Trp 


Asp 


Val 


pro 








a n 
41 U 


He 


Pro 


Glu 


Trp 






55 




Ala 


Ser 


Lys 


Gly 




7 0 






Asp 


Tyr 


O T-> £5 

t ne 


M S p 


8 5 








TV v - ^ 

Arg 


r n e 


o ± y 


O J- 


Aia 


I yr 


± y 










1 90 


Gly 


Gly 


Asp 


Leu 






135 




Asp 


O K /-i 

r ne 


Cor 

o e r 


T wo 
Liy d 




1 0 U 






r ne 


HIS 


riO 


A c n 


lob 








I y r 


D >- /~i 
r 1 O 


A o r> 

nop 


lie 


Mid 














? on 


Ala 


Trp 


Arg 


Phe 






215 




MS p 


i rp 


Lit, U 


Acn 




o t n 






T 1 V-> v- 

l nr 


I\ en 

ftsn 


V d _L 


Z\ on 


2 4b 








v a i 


r ne 


nb p 


Php 


Lys 


Asn 


He 


Pro 






280 


Val 


Ser 


Arg 


Asp 






295 




Thr 


Asp 


He 


He 




310 






Tyr 


Glu 


Gly 


Gin 


325 








Asn 


Lys 


Asp 


Lys 


Gly 


Gly 


Ser 


Thr 








360 


Val 


Arg 


Asn 


Gly 






375 




Leu 


Gly 


Ser 


Ser 




390 







Ala Leu Leu Ala 
10 

Leu Glu Glu Gly 
25 

Ser Gly Gly He 

Tyr Asp Ala Gly 

60 

Met Ser Gly Gly 

75 

Leu Gly Glu Tyr 
90 

Lys Gin Glu Leu 
105 

Lys Val He Ala 

Glu Trp Asn Pro 

140 

Val Ala Ser Gly 
155 

Glu Leu His Ala 
170 

Cys His Asp Lys 
185 

Ser Tyr Ala Ala 

Asp Tyr Val Lys 

220 

Trp Trp Gly Gly 

235 

Ala Leu Leu Asn 
250 

Pro Leu Tyr Tyr 
265 

Ala Leu Val Glu 

Pro Phe Lys Ala 

300 

Trp Asn Lys Tyr 
315 

Pro Thr He Phe 
330 

Leu Lys Asn Leu 
345 

Ser He Val Tyr 

Tyr Gly Ser Lys 

380 

Lys Val Gly Arg 
3 95 



Leu 


Ala 


Ala 


Ser 






15 




ui y 


vox 


I le 


Met 




^n 






P 


T r o 


Asn 


Thr 


H D 








T 1 p 


Qpr 


Ala 


He 


T v r 

y 




Met 


Gl v 








8 0 


Tyr 


Gin 


Lys 


Gly 






95 




He 




Met 


He 










As D 


He 


Val 


He 


125 








Phe 


Val 


Gly 


Asp 


L vs 


Tvr 


Thr 


Ala 








160 


Gly 


Asp 


Ser 


Gly 






175 




^p r 




Asd 


Gin 




1 90 






Tvr 


Leu 


Arg 


Ser 


205 

C- \S 








Gl v 


Tvr 


Gly 


Ala 


Trp 


Ala 


Val 


Gly 








240 


Trp 


Ala 


Tyr 


Ser 






255 




L vs 


Met 


Asp 


Ala 




OTA 






Ala 


Leu 


Lys 


Asn 


285 








Val 


Thr 


Phe 


Val 


Pro 


Ala 


Tyr 


Ala 








320 


Tyr 


Arg 


Asp 


Tyr 






335 




He 


Trp 


He 


His 




350 






Tyr 


Asp 


Ser 


Asp 


365 








Pro 


Gly 


Leu 


He 


Trp 


Val 


Tyr 


Val 








400 
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Pro Lys Phe Ala Gly 

405 

Gly Trp Val Asp Lys 

420 

Ala Pro Ala Tyr Asp 
4 35 

Ser Tyr Cys Gly Val 
450 



Ala Cys lie His Glu Tyr 

4 10 

Tyr Val Tyr Ser Ser Gly 

425 

Pro Ala Asn Gly Gin Tyr 
4 40 

Gly Ser Glu Lys Asp Glu 
4 55 



Thr Gly Asn Leu Gly 

4 15 

Trp Val Tyr Leu Glu 
430 

Gly Tyr Ser Val Trp 
445 

Leu 
460 



<210> 34 
<211> 825 
<212> PRT 
<213> Artificial 



Sequence 



<220> 

< 2 2 3 > synthetic 



<400> 34 
Met Arg Val Leu 
1 

Ala Thr Ser Ala 

20 

Thr Gly Glu Asp 
35 

Asp Leu Thr Lys 
50 

Lys Asp Val Ala 
65 

Glu Val Trp He 

Asp Thr Ser Pro 

100 

He Glu Ala Phe 
115 

Phe Lys Val Thr 
130 

Lys Ala Asp Pro 
14 5 

Leu Ser Glu Ser 

He He Glu Gly 

180 

Asp Asp Tyr Tyr 
195 

Lys Thr He Phe 
210 

Leu Leu Phe Lys 

225 

Met Glu Tyr Lys 

Leu Asp Gly Val 

260 

Arg Thr Thr Val 
275 

Glu Ser Ala Val 
290 

Asn Asp Arg Gly 



Leu Val Ala Leu 
5 

Gly His Trp Tyr 

Asp Phe Gly Lys 

40 

Val Gly He He 
55 

Lys Asp Arg Phe 
70 

Leu Gin Gly Val 
85 

Arg He Phe Phe 

Leu Thr Asn Pro 

120 

Val Asp Gly Lys 
135 

Thr Asp He Asp 
150 

Leu Lys Glu Glu 
165 

Tyr Lys Pro Ala 

Tyr Asp Gly Glu 

200 

Arg Val Trp Ser 
215 

Asn Gly Glu Asp 

230 

Gly Asn Gly Val 
245 

Phe Tyr Leu Tyr 

Asp Pro Tyr Ser 

280 

Val Asn Leu Ala 
295 

Pro Lys He Glu 



Ala Leu Leu Ala 
10 

Lys His Gin Arg 
25 

Val Ala Val Val 

Val Arg Leu Asn 

60 

lie Glu lie Lys 
75 

Glu Glu He Phe 
90 

Ala Gin Ala Arg 
105 

Val Asp Thr Lys 

Glu He Pro Val 

140 

Val Thr Asn Tyr 
155 

Asp Leu Arg Lys 
170 

Arg Val lie Met 
185 

Leu Gly Ala Val 

Pro Val Ser Lys 

220 

Thr Glu Pro Tyr 
235 

Trp Glu Ala Val 
250 

Gin Leu Glu Asn 
265 

Lys Ala Val Tyr 

Arg Thr Asn Pro 

300 

Gly Tyr Glu Asp 



Leu Ala Ala Ser 
15 

Ala Tyr Gin Phe 
30 

Lys Leu Pro Met 
45 

Glu Trp Gin Ala 

Asp Gly Lys Ala 

80 

Tyr Glu Lys Pro 
95 

Ser Asn Lys Val 
110 

Lys Lys Glu Leu 
125 

Ser Arg Val Glu 

Val Arg He Val 

160 

Asp Val Glu Leu 
175 

Met Glu lie Leu 
190 

Tyr Ser Pro Glu 
205 

Trp Val Lys Val 

Gin Val Val Asn 

240 

Val Glu Gly Asp 
255 

Tyr Gly Lys He 
270 

Ala Asn Asn Gin 
285 

Glu Gly Trp Glu 
Ala lie lie Tyr 



Page 35 



WO 2005/096804 



PCT/US2004/007182 



305 



Glu 


lie 


His 


lie 


Asn 


Lys 


Gly 


Leu 








3 4 0 


Gly 


Gly 


Val 


Thr 






■1 c c 

Job 




His 


Va 1 


His 


lie 




37 0 






Asp 


Lys 


Asp 


Phe 


3 8 5 








Phe 


Met 


Va 1 


Pro 


Thr 


Arg 


He 


Arg 






4 20 


Gly 


lie 


Gly 


Va 1 






4 3b 




Gly 


Glu 


Leu 


Ser 




4 bU 






lie 


Asp 


Lys 


T 1 l~i v- 

lnr 


4 b b 








lie 


TV 1 - 

Al a 


Ser 


b 1 U 


Thr 


Tyr 


Trp 


Val 








500 


Met 


Gly 


Leu 


I le 






bib 




His 


Lys 


He 


Asp 




b 3 U 






T rp 


Gly 


Ala 


r r o 


b 4 b 








Va l 


Ala 


Ai a 


r n e 


Phe 


Asn 


Pro 


Ser 








con 

b o u 


Thr 


Lys 


He 


Lys 






C Q C. 

b y b 




Leu 


lie 


Lys 


Ser 




D 1 U 






Ala 


Cy s 


HIS 


Asp 


b2 5 








Lys 


Ala 


ASp 


Lys 


Gin 


Lys 


Leu 


Ala 








b bU 


Leu 


His 


Gly 


Gly 






a n 
b / b 




Ser 


Tyr 


Asn 


Ala 










Leu 


Gin 


Phe 


lie 


705 








Arg 


Lys 


Glu 


His 


Lys 


His 


Leu 


Glu 






740 


Leu 


Lys 


Asp 


His 






755 




Tyr 


Asn 


Gly 


Asn 





j] u 






Ala 


ASp 


T 1 <=> 
11c 


l 1 1 i- 


-j o c 

32 b 








Tyr 


LcU 


C 1 w 


T 1 1 

J_)tT Li 


Thr 


Gly 


Leu 


Ser 








360 


Leu 


Pro 


r ne 


Dho 

rne 






TIC 

j> / b 




Glu 


Lys 


Tyr 


1 yr 




D O A 

3 90 






G1U 


Giy 


al g 


l yr 


4 05 








GlU 


val 


L*y s 


ul U 


He 


Met 


Asp 


Met 








440 


Aia 


r ne 


M is p 








/ICC 

41 b b 




Gly 


Ala 


i yr 


T 1 — ^ t i 

LiQ U 




/no 






Arg 


D v- 




Mot* 


4 o b 








Lys 


Pin 
o 1 U 


T 1 \/ y 

i y r 


n x o 


Asp 


Lys 


Lys 


Thr 








520 


r r o 


1 III 


J. lc 
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ASp 


v a i 


tr n tr 


Aon 
M is I 1 




710 






Pro 


Ala 


Phe 


Arg 


725 








Phe 


Leu 


Pro 


Gly 


Ala 


Gly 


Gly 


Asp 








760 


Leu 


Glu 


Lys 


Thr 







315 




Gl v 

\j -L y 


T ,p u 


Glu 


Asn 




130 

J w 






Thr 


Glu 


Glu 

^ ' -iw V—» 


Asn 










His 


T.pn 
i_j 


Va 1 


Glu 


Asp 


Phe 


Tyr 


Thr 








380 


Sen 
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1 x p 
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3 9S 
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Pro 




H X U 
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Va 1 


L vs 


Ala 


4 ? S 








Val 
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Pro 


His 


Thr 


Val 


Pro 


Tyr 








4 60 


As n 


Glu 


Ser 


Gly 






4 7 5 




Ar a 


Lys 


Phe 


He 




4 90 






He 
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Asp 


Gl v 


Phe 


SOS 

w* \J <J 








Met 


Leu 


Glu 


Val 


Leu 


Tyr 


Gly 


Glu 








54 0 




Ser 


Asd 


Val 






555 




Arg 


Asp 


Ala 


lie 




570 






Va 1 


Met 


Gl v 


Gly 


58 5 








Gl v 

ox y 


Ser 


He 


Asn 


Pro 


Glu 


Glu 


Thr 








620 


TrD 


Asp 


L vs 


Asn 






635 




Thr 


Glu 


Glu 


Glu 




650 






Le u 


Thr 


Ser 


Gin 


665 








Arg 


Thr 


Thr 


Asn 


Asn 


Gly 


Phe 


Asp 








700 


T vr 


His 


Lys 


Gly 






715 




Leu 


Lys 


Asn 


Ala 
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Gly 


Arg 


Arg 


lie 


745 








Pro 


Trp 


Lys 


Asp 


Thr 


Tyr 


Lys 


Leu 
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335 
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Ala 


Pro 




350 
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Val 
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365 
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Asp 
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Pro 


Tyr 
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Arg 
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Asn 


Va 1 








480 


Va 1 


Asd 


Thr 


Val 






495 




Ara 


Phe 


Asp 


Gin 




510 






Glu 


Arg 


Ala 


Leu 


525 








Pro 


Trp 


Gly 


Gly 


Ala 


Gly 


Thr 


His 








560 


Arg 


Gly 


Ser 


Val 






575 




T vr 


Gly 


Lys 


Glu 




590 






Tvr 


Asp 


Gly 


Lys 


605 








He 


Asn 


Tyr 


Ala 


Tyr 


Leu 


Ala 


A 1 — 

Ala 






640 


Leu 


Lvs 


Asn 


Ala 






655 




Glv 


Val 


Pro 


Phe 




670 






Phe 


Asn 


Asp 


Asn 


685 








Tvr 


Glu 


Arg 


Lys 


Leu 


He 


Lys 


Leu 








720 


Glu 


Glu 


He 


Lys 






735 




Val 


Ala 


Phe 


Met 




750 






He 


Val 


Val 


lie 


765 








Pro 


Glu 


Gly 


Lys 
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770 

Trp Asn Val Val 

735 

Thr Val Glu Gly 

Tyr Arg Glu Ser 

820 



775 

Val Asn Ser Gin 
790 

Thr He Glu Leu 
805 

Glu Lys Asp Glu 



780 

Lys Ala Gly Thr 
795 

Asp Pro Leu Ser 
810 

Leu 
825 



Glu Val He Glu 

800 

Ala Tyr Val Leu 
815 



<210> 35 
<211> 460 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic 
<400> 35 

Met Arg Val Leu Leu Val Ala Leu Ala Leu Leu Ala Leu Ala Ala Ser 

15 10 15 

Ala Thr Ser Ala Lys Tyr Leu Glu Leu Glu Glu Gly Gly Val He Met 

20 25 30 

Gin Ala Phe Tyr Trp Asp Val Pro Ser Gly Gly He Trp Trp Asp Thr 

35 40 45 

lie Arg Gin Lys He Pro Glu Trp Tyr Asp Ala Gly He Ser Ala He 

50 55 60 

Trp He Pro Pro Ala Ser Lys Gly Met Ser Gly Gly Tyr Ser Met Gly 
65 70 75 80 

Tyr Asp Pro Tyr Asp Tyr Phe Asp Leu Gly Glu Tyr Tyr Gin Lys Gly 

85 90 95 

Thr Val Glu Thr Arg Phe Gly Ser Lys Gin Glu Leu lie Asn Met lie 

100 105 110 

Asn Thr Ala His Ala Tyr Gly He Lys Val He Ala Asp lie Val lie 

115 120 125 

Asn His Arg Ala Gly Gly Asp Leu Glu Trp Asn Pro Phe Val Gly Asp 

130 13 5 14 0 

Tyr Thr Trp Thr Asp Phe Ser Lys Val Ala Ser Gly Lys Tyr Thr Ala 
145 150 155 160 

Asn Tyr Leu Asp Phe His Pro Asn Glu Leu His Ala Gly Asp Ser Gly 

165 170 175 

Thr Phe Gly Gly Tyr Pro Asp lie Cys His Asp Lys Ser Trp Asp Gin 

180 185 190 

Tyr Trp Leu Trp Ala Ser Gin Glu Ser Tyr Ala Ala Tyr Leu Arg Ser 

195 200 205 

He Gly lie Asp Ala Trp Arg Phe Asp Tyr Val Lys Gly Tyr Gly Ala 

210 215 220 

Trp Val Val Lys Asp Trp Leu Asn Trp Trp Gly Gly Trp Ala Val Gly 
.225 230 235 240 

Glu Tyr Trp Asp Thr Asn Val Asp Ala Leu Leu Asn Trp Ala Tyr Ser 

245 250 255 

Ser Gly Ala Lys Val Phe Asp Phe Pro Leu Tyr Tyr Lys Met Asp Ala 

260 265 270 

Ala Phe Asp Asn Lys Asn lie Pro Ala Leu Val Glu Ala Leu Lys Asn 

275 280 285 

Gly Gly Thr Val Val Ser Arg Asp Pro Phe Lys Ala Val Thr Phe Val 

290 295 300 

Ala Asn His Asp Thr Asp lie lie Trp Asn Lys Tyr Pro Ala Tyr Ala 
305 310 315 320 
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T . i i 


Thr 


T vr 
32 5 
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Gly 


Gly 


Ser 
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360 


Glu 


Met 
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Arg 
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Gly 
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Gly 


Ser 


Ser 
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Pro 


Lys 


Phe 


Ala 


Gly 


Ala 


Cys 


He 
















Gly 


Trp 


Val 


Asp 
420 


Lys 


Tyr 


Val 


Tyr 


Ala 


Pro 


Ala 
435 


Tyr 


Asp 


Pro 


Ala 


Asn 
440 


Ser 


Tyr 
450 


Cys 


Gly 


Val 


Gly 


Ser 
455 


Glu 



<210> 36 
<211> 718 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic 



<400> 36 



Met 


Arg 


Val 


Leu 


Leu 
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Val 


Ala 


Leu 


1 
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Thr 


Ser 


Met 


Glu 
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He 
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Val 
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Gly 
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Phe 
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Tyr 
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Lys 


Asn 
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He 


Trp 


He 


His 


34 5 
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He 


Val 


Tvr 


Tyr 
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Asp 


Ser 


Asp 


Tyr 


Gly 


Ser 
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Gly 
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He 


Lys 


Val 


Gly 
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Arg 


Trp 


Val 


Tyr 


Val 

4 00 


His 


Glu 
4 10 


Tyr 


Thr 


Gly 


Asn 


Leu 
4 15 


Gly 


Ser 


Ser 


Gly 


Trp 


Val 


Tyr 


Leu 


Glu 


425 










430 






Gly 


Gin 


Tyr 


Gly 


Tyr 
4 45 


Ser 


Val 


Trp 


Lys 


Asp 


Glu 


Leu 
460 
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Arg 
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Leu 
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Val 
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Asp 


Asp 
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Val 
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Val 
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Ser 
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650 
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Val 
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Ser 


665 
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Val 
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Val 
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He 
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Lys 
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690 695 700 

lie Arg Gly Lys lie Asn Leu Glu Ser Glu Lys Asp Glu Leu 
7 05 710 715 



<2i0> 37 
<211> 1434 
<212> DNA 

<213> Thermotoga maritima 
<400> 37 

atgaaagaaa ccgctgctgc taaattcgaa cgccagcaca tggacagccc agatctgggt 60 
accctggtgc cacgcggttc catggccgag ttcttcccgg agatcccgaa gatccagttc 120 
gagggcaagg agtccaccaa cccgctcgcc ttccgcttct acgacccgaa cgaggtgatc 180 
gacggcaagc cgctcaagga ccacctcaag ttctccgtgg ccttctggca caccttcgtg 240 
aacgagggcc gcgacccgtt cggcgacccg accgccgagc gcccgtggaa ccgcttctcc 300 
gacccgatgg acaaggcctt cgcccgcgtg gacgccctct tcgagttctg cgagaagctc 360 
aacatcgagt acttctgctt ccacqaccgc gacatcgccc cggagggcaa gaccctccgc 420 
gagaccaaca agatcctcga caaggtggtg gagcgcatca aggagcgcat gaaggactcc 480 
aacgtgaagc tcctctgggg caccgccaac ctcttctccc acccgcgcta catgcacggc 540 
gccgccacca cctgctccgc cgacgtgttc gcctacgccg ccgcccaggt gaagaaggcc 600 
ctggagatca ccaaggagct gggcggcgag ggctacgtgt tctggggcgg ccgcgagggc 660 
tacgagaccc tcctcaacac cgacctcggc ctggagctgg agaacctcgc ccgcttcctc 720 
cgcatggccg tggagtacgc caagaagatc ggcttcaccg gccagttcct catcgagccg 780 
aagccgaagg agccgaccaa gcaccagtac gacttcgacg tggccaccgc ctacgccttc 840 
ctcaagaacc acggcctcga cgagtacttc aagttcaaca tcgaggccaa ccacgccacc 900 
ctcgccggcc acaccttcca gcacgagctg cgcatggccc gcatcctcgg caagctcggc 960 
tccatcgacg ccaaccaggg cgacctcctc ctcggctggg acaccgacca gttcccgacc 1020 
aacatctacg acaccaccct cgccatgtac gaggtgatca aggccggcgg cttcaccaag 1080 
ggcggcctca acttcgacgc caaggtgcgc cgcgcctcct acaaggtgga ggacctcttc 1140 
atcggccaca tcgccggcat ggacaccttc gccctcggct tcaagatcgc ctacaagctc 1200 
gccaaggacg gcgtgttcga caagttcatc gaggagaagt accgctcctt caaggagggc 1260 
atcggcaagg agatcgtgga gggcaagacc gacttcgaga agctggagga gtacatcatc 1320 
gacaaggagg acatcgagct gccgtccggc aagcaggagt acctggagtc cctcctcaac 1380 
tcctacatcg tgaagaccat cgccgagctg cgctccgaga aggacgagct gtga 1434 

<210> 38 
<211> 477 
<212> PRT 

<213> Thermotoga maritima 
<400> 38 



Met 


Lys 


Glu 


Thr 


Ala 


Ala 


Ala 


Lys 


Phe 


Glu 


Arg 


Gin 


His 


Met 


Asp 


Ser 


1 








5 










10 










15 




Pro 


Asp 


Leu 


Gly 
20 


Thr 


Leu 


Val 


Pro 


Arg 
25' 


Gly 


Ser 


Met 


Ala 


Glu 
30 


Phe 


Phe 


Pro 


Glu 


lie 

35 


Pro 


Lys 


lie 


Gin 


Phe 
40 


Glu 


Gly 


Lys 


Glu 


Ser 
45 


Thr 


Asn 


Pro 


Leu 


Ala 
50 


Phe 


Arg 


Phe 


Tyr 


Asp 
55 


Pro 


Asn 


Glu 


Val 


He 
60 


Asp 


Gly 


Lys 


Pro 


Leu 


Lys 


Asp 


His 


Leu 


Lys 


Phe 


Ser 


Val 


Ala 


Phe 


Trp 


His 


Thr 


Phe 


Val 


65 










70 










75 










80 


Asn 


Glu 


Gly 


Arg 


Asp 
85 


Pro 


Phe 


Gly 


Asp 


Pro 
90 


Thr 


Ala 


Glu 


Arg 


Pro 
95 


Trp 


Asn 


Arg 


Phe 


Ser 
100 


Asp 


Pro 


Met 


Asp 


Lys 
105 


Ala 


Phe 


Ala 


Arg 


Val 
110 


Asp 


Ala 


Leu 


Phe 


Glu 
115 


Phe 


Cys 


Glu 


Lys 


Leu 
120 


Asn 


lie 


Glu 


Tyr 


Phe 
125 


Cys 


Phe 


His 
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Asp 


Arg 


Asp 


lie 




130 






lie 


Leu 


Asp 


Lys 


14 5 








Asn 


Val 


Lys 


Leu 


Tyr 


Met 


His 


Gly 








180 


Ala 


Ala 


Ala 


Gin 






195 




Gl y 


Glu 


Gly 


Tyr 




210 






Leu 


Asn 


Thr 


Asp 


225 








Arg 


Met 


Ala 


Val 


Leu 


lie 


Glu 


Pro 








260 


Asp 


Va 1 


Al a 


inr 






275 




Tyr 


Phe 


Lys 


Phe 




290 






Thr 


Phe 


Gin 


His 


305 








Ser 


He 


Asp 


Ala 


Gln 


Phe 


Pro 


Thr 








340 


lie 


Lys 


Mia 








355 




Val 


Arg 


Arg 


Ala 




370 






Ala 


Gly 

-* 


Met 


Asp 


385 








Ala 


Lys 


Asp 


Gly 


Phe 


Lys 


Glu 


Gly 








420 


Glu 


Lys 


Leu 


Glu 






435 




Ser 


Gly 


Lys 


Gin 




4 50 






Lys 


Thr 


He 


Ala 



465 



Ala 


D >- — \ 

r r o 




\j j. y 






135 




Val 


Val 


Glu 


Arg 




150 






Leu 


Trp 


Gly 


Thr 


165 








Ala 


Ala 


Thr 


Thr 


Val 


Lys 


Lys 


Ala 








200 


v a i 


D h a 

r n e 


i rp 


o x y 






215 




Leu 


Gly 


Leu 


Glu 




230 






Glu 


Tyr 


Ala 


Lys 


24 5 








Lys 


Pro 


Lys 


Glu 


Ala 


Tyr 


Ala 


Phe 






280 


MS n 


T 1 o 

lie 


o 1 U 








295 




Glu 


Leu 


Arg 


Met 




310 






Asn 


Gin 


Gly 


Asp 


325 








Asn 


He 


Tyr 


Asp 


Gly 


Phe 


Thr 


Lys 








360 


Q o r* 


T \7 T 

i y l 




Va 1 






375 




Thr 


Phe 


Ala 


Leu 




390 






Val 


Phe 


Asp 


Lys 


405 








He 


Gly 


Lys 


Glu 


Glu 


Tyr 


He 


He 






440 


Glu 


Tyr 


Leu 


Glu 






455 




Glu 


Leu 


Arg 


Ser 




470 







Lys 


Thr 


Leu 


Arg 








140 


T 1 d 
lit: 


T \ / c 
Ly o 


r,i n 
ui u 


r\ L. VJ 






155 




Ala 


Asn 


Leu 


Phe 




170 






Cys 


Ser 


Ala 


Asp 


185 








Leu 


Glu 


He 


Thr 


Gly 


Arg 


Glu 


Gly 








220 


Li ti U 


O -L U 


Moll 








235 




Lys 


He 


Gly 


Phe 




250 






Pro 


Thr 


Lys 


His 


265 








Leu 


Lys 


Asn 


His 


Asn 


His 


Ala 


Thr 








300 


Mid 


nl y 




T.p 11 

J— J ^ n 






315 




Leu 


Leu 


Leu 


Gly 




330 






Thr 


Thr 


Leu 


Ala 


345 








Gly 


Gly 


Leu 


Asn 


Glu 


Asp 


Leu 


Phe 








380 




Phe 


L vs 


He 






395 




Phe 


He 


Glu 


Glu 




4 10 






He 


Val 


Glu 


Gly 


425 








Asp 


Lys 


Glu 


Asp 


Ser 


Leu 


Leu 


Asn 








460 


Glu 


Lys 


Asp 


Glu 






475 





Glu 


Thr 


Asn 


Lys 


Met 


Lys 


Asp 


Ser 








160 




His 


Pro 


Arg 






175 




Val 


Phe 


Ala 


Tyr 




190 






Lys 


Glu 


Leu 


Gly 


205 








Tyr 


Glu 


Thr 


Leu 


Ala 


Arg 


Phe 


Leu 








240 


X 1 1 J- 


Gl v 


Gin 


Phe 






255 




Gin 


Tyr 


Asp 


Phe 




270 






Gly 


Leu 


Asp 


Glu 


285 








Leu 


Ala 


Gly 


His 


Gly 


Lys 


Leu 


Gly 








320 


T r o 


As D 


Thr 


Asp 






335 




Met 


Tyr 


Glu 


Val 




350 






Phe 


Asp 


Ala 


Lys 


365 








He 


Gly 


His 


lie 


Ala 


Tyr 


Lys 


Leu 








400 


Lys 


Tyr 


Arg 


Ser 






415 




Lys 


Thr 


Asp 


Phe 




430 






lie 


Glu 


Leu 


Pro 


445 








Ser 


Tyr 


lie 


Val 



Leu 



<210> 39 
< 2 1 1 > 1434 
<212> DNA 

<213> Thermotoga neapolitana 

tggacagccc agatctgggt 60 
agatcccgaa ggtgcagttc 120 
acgacccgga ggagatcatc 180 
ccttctggca caccttcgtg 240 
gcccgtggaa ccgctacacc 300 
tcgagttctg cgagaagctc 360 



<400> 39 
a t gaaagaaa 
accct ggtgc 
gagggcaagg 
gacggcaagc 
aacgagggcc 
gacccga tgg 



ccgctgctgc 
cacgcggtt c 
agtccaccaa 
cgct caagga 
gcgacccgtt 
acaaggcct t 



taaa 1 1 cgaa 
ca t ggccgag 
cccgct cgcc 
ccacct caag 
cggcgacccg 
cgcccgcgtg 



cgccagcaca 
t tct tcccgg 
t tcaagt tct 
t tctccgtgg 
accgccgacc 
gacgccctct 
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aacatcgagt acttctgctt ccacgaccgc gacatcgccc cggagggcaa gaccctccgc 420 
gagaccaaca agatcctcga caaggtggtg gagcgcatca aggagcgcat gaaggactcc 48u 



aacgtgaagc tcctctgggg caccgccaac ctcttctccc acccgcgcta catgcacggc 540 
qccaccacca cctgctccgc cgacgtgttc gcctacgccg ccgcccaggt gaagaaggcc 600 
ctggagatca ccaaggagct gggcggcgag ggctacgtgt tctggggcgg ccgcgagggc 660 



tacgagaccc tcctcaacac cgacctcggc ttcgagctgg agaacctcgc ccgcttcctc 720 

840 
900 

ctcgccggcc acaccttcca gcacgagctg cgcatggccc gcatcctcgg caagctcggc 960 



cgcatggccg tggactacgc caagcgcatc ggcttcaccg ^cagttcct catcgagccg 

900 
960 

tccatcgacg ccaaccaggg cgacctcctc ctcggctggg acaccgacca gttcccgacc 1020 

- ~ ■ - - — — *- *~ ■"" "■ • - 1 0 o 0 



cgcatggccg iqydt idty^ Laay^you ^ ^ ^^w^^-w^*-, ^ ^ 

aagccgaagg agccgaccaa gcaccagtac gacttcgacg tggccaccgc ctacgccttc 

ctcaagtccc acggcctcga cgagtacttc aagttcaaca tcgaggccaa ccacgccacc 900 



aacgtgtacg acaccaccct cgccatgtac gaggtgatca aggccggcgg cttcaccaag 

ggcggcctca acttcgacgc caaggtgcgc cgcgcctcct acaaggtgga ggacctcttc 1140 

atcggccaca tcgccggcat ggacaccttc gccctcggct tcaaggtggc ctacaagctc 1200 

gtgaaggacg gcgtgctcga caagttcatc gaggagaagt accgctcctt ccgcgagggc 1260 

atcggccgcg acatcgtgga gggcaaggtg gacttcgaga agctggagga gtacatcatc 1320 

gacaaggaga ccatcgagct gccgtccggc aagcaggagt acctggagtc cctcatcaac 1380 

tcctacatcg tgaagaccat cctggagctg cgctccgaga aggacgagct gtga 14J4 



<^-10> 40 



- o 

<211> 4 77 
<212> PRT 

<213> Thermotoga neapolitana 



<400> 40 



Met 


Lys 


Glu 


Thr 


1 

Pro 


Asp 


Leu 


Gly 








20 


Pro 


Glu 


lie 


Pro 






35 




Leu 


Ala 


Phe 


Lys 




50 






Leu 


Lys 


Asp 


His 


65 








Asn 


Glu 


Gly 


Arg 


Asn 


Arg 


Tyr 


Thr 








100 


Leu 


Phe 


Glu 


Phe 






115 




Asp 


Arg 


Asp 


lie 




130 






lie 


Leu 


Asp 


Lys 


14 5 








Asn 


Vai 


Lys 


Leu 


Tyr 


Met 


His 


Gly 






180 


Ala 


Ala 


Ala 


Gin 






195 




Gly 


Glu 


Gly 


Tyr 




210 






Leu 


Asn 


Thr 


Asp 


225 








Arg 


Met 


Ala 


Vai 


Leu 


lie 


Glu 


Pro 



Ala 


Ala 


Ala 


Lys 


5 








Thr 


Leu 


Vai 


Pro 


Lys 


Vai 


Gin 


Phe 






40 


Phe 


Tyr 


Asp 


Pro 






55 




Leu 


Lys 


Phe 


Ser 




70 






Asp 


Pro 


Phe 


Gly 


85 








Asp 


Pro 


Met 


Asp 


Cys 


Glu 


Lys 


Leu 








120 


Ala 


Pro 


Glu 


Gly 






135 




Vai 


Vai 


Glu 


Arg 




150 






Leu 


Trp 


Gly 


Thr 


165 








Ala 


Ala 


Thr 


Thr 


Vai 


Lys 


Lys 


Ala 






200 


Vai 


Phe 


Trp 


Gly 






215 




Leu 


Gly 


Phe 


Glu 




230 






Asp 


Tyr 


Ala 


Lys 


245 








Lys 


Pro 


Lys 


Glu 



Phe Glu Arg Gin 
10 

Arg Gly Ser Met 
25 

Glu Gly Lys Glu 

Glu Glu He He 

60 

Vai Ala Phe Trp 
75 

Asp Pro Thr Ala 

90 

Lys Ala Phe Ala 
105 

Asn lie Glu Tyr 

Lys Thr Leu Arg 

140 

He Lys Glu Arg 
155 

Ala Asn Leu Phe 
170 

Cys Ser Ala Asp 
185 

Leu Glu He Thr 

Gly Arg Glu Gly 

220 

Leu Glu Asn Leu 
235 

Arg He Gly Phe 

250 

Pro Thr Lys His 



His 


Met 


Asp 


Ser 






15 




Ala 


Glu 


Phe 


Phe 




30 






Ser 


Thr 


Asn 


Pro 


45 








Asp 


Gly 


Lys 


Pro 


His 


Thr 


Phe 


Vai 








80 


Asp 


Arg 


Pro 


Trp 






95 




Arg 


Vai 


Asp 


Ala 




110 






Phe 


Cys 


Phe 


His 


125 








Glu 


Thr 


Asn 


Lys 


Met 


Lys 


Asp 


Ser 








160 


Ser 


His 


Pro 


Arg 






175 




Vai 


Phe 


Ala 


Tyr 




190 






Lys 


Glu 


Leu 


Gly 


205 








Tyr 


Glu 


Thr 


Leu 


Ala 


Arg 


Phe 


Leu 








240 


Thr 


Gly 


Gin 


Phe 






255 




Gin 


Tyr 


Asp 


Phe 
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260 

Asp Val Ala Thr Ala 
275 

Tyr Phe Lys Phe Asn 
290 

Thr Phe Gin His Glu 

305 

Ser He Asp Ala Asn 

325 

Gin Phe Pro Thr Asn 

340 

He Lys Ala Gly Gly 
355 

Val Arg Arg Ala Ser 

370 

Ala Gly Met Asp Thr 
385 

Val Lys Asp Gly Val 

4 05 

Phe Arg Glu Gly He 

420 

Glu Lys Leu Glu Glu 
4 35 

Ser Gly Lys Gin Glu 
4 50 

Lys Thr lie Leu Glu 
465 









265 






Tyr 


Ala 


Phe 


Leu 


Lys 


Ser 




280 








He 


Glu 
295 


Ala 


Asn 


His 


Ala 


Leu 


Arg 


Met 


Ala 


Arg 


lie 


310 










315 


Gin 


Gly 


Asp 


Leu 


Leu 


Leu 








330 




Val 


Tyr 


Asp 


Thr 

345 


Thr 


Leu 


Phe 


Thr 


Lys 
360 


Gly 


Gly 


Leu 


Tyr 


Lys 
375 


Val 


Glu 


Asp 


Leu 


Phe 


Ala 


Leu 


Gly 


Phe 


Lys 


390 










395 


Leu 


Asp 


Lys 


Phe 


He 
4 10 


Glu 


Gly 


Arg 


Asp 


He 
425 


Val 


Glu 


Tyr 


He 


He 
440 


Asp 


Lys 


Glu 


Tyr 


Leu 


Glu 


Ser 


Leu 


He 


455 










Leu 


Arg 


Ser 


Glu 


Lys 


Asp 


470 










475 



270 



His 


Gly 
285 


Leu 


Asp 


Glu 


Thr 


Leu 


Ala 


Gly 


His 


300 










Leu 


Gly 


Lys 


Leu 


Gly 
320 


Gly 


Trp 


Asp 


Thr 
335 


Asp 


Ala 


Met 


Tyr 
350 


Glu 


Val 


Asn 


Phe 
365 


Asp 


Ala 


Lys 


Phe 


He 


Gly 


His 


He 


380 










Val 


Ala 


Tyr 


Lys 


Leu 
400 


Glu 


Lys 


Tyr 


Arg 
415 


Ser 


Gly 


Lys 


Val 
430 


Asp 


Phe 


Thr 


He 
445 


Glu 


Leu 


Pro 


Asn 


Ser 


Tyr 


He 


Val 



4 60 

Glu Leu 



<210> 41 

<211> 1435 

<212> DNA 

<213> Thermotoga maritima 



tacgacccga acgaggtgat cgacggcaag ccgctcaagg accacctcaa gttctccgtg 
gccttctggc acaccttcgt gaacgagggc cgcgacccgt tcggcgaccc gaccgccgag 300 
cgcccgtgga accgcttctc cgacccgatg gacaaggcct tcgcccgcgt ggacgccctc 360 



<400> 41 

atgggcagca gccatcatca tcatcatcac agcagcggcc tggtgccgcg cggcagccat 60 

atggctagca tgactggtgg acagcaaatg ggtcggatcc ccatggccga gttcttcccg 120 

gagatcccga agatccagtt cgagggcaag gagtccacca acccgctcgc cttccgcttc 180 

ttcgagttct gcgagaagct caacatcgag tacttctgct tccacgaccg cgacatcccc 420 

cggagggcaa gaccctccgc gagaccaaca agatcctcga caaggtggtg gagcgcatca 480 

aggagcgcat gaaggactcc aacgtgaagc tcctctgggg caccgccaac ctcttctccc 540 

acccgcgcta catgcacggc gccgccacca cctgctccgc cgacgtgttc gcctacgccg 600 

ccgcccaggt gaagaaggcc ctggagatca ccaaggagct gggcggcgag ggctacgtgt 660 

tctggggcgg ccgcgagggc tacgagaccc tcctcaacac cgacctcggc ctggagctgg 720 

agaacctccc ccgcttcctc cgcatggccg tggagtacgc caagaagatc ggcttcaccg 780 

gccagttcct catcgagccg aagccgaagg agccgaccaa gcaccagtac gcttcgacgt 840 



ggccaccgcc tacgccttcc tcaagaacca cggcctcgac gagtacttca agttcaacat 900 
cgaggccaac cacgccaccc tcgccggcca caccttccag cacgagctgc gcatggcccg 
catcctcggc aagctcggct ccatcgacgc caaccagggc gacctcctcc tcggctggga 1020 



960 



caccgaccag ttcccgacca acatctacga caccaccctc gccatgtacg aggtgatcaa 1080 

ggccggcggc ttcaccaagg gcggcctcaa cttcgacgcc aaggtgcgcc gcgcctccta 1140 

caaggtggag gacctcttca tcggccacat cgccggcatg gacaccttcg ccctcggctt 1200 

caagatcgcc tacaagctcg ccaaggacgg cgtgttcgac aagttcatcg aggagaagta 1260 

ccgctccttc aaggagggca tcggcaagga gatcgtggag ggcaagaccg acttcgagaa 1320 

gctggaggag tacatcatcg acaaggagga catcgagctg ccgtccggca agcaggagta 1380 
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cctggagtcc ctcctcaact cctacatcgt gaagaccatc gccgagctgc gctga 



1435 



<210> 42 

< 2 1 1 > 4 73 

<212> PRT 

<213> Thermotoga maritima 



<400> 42 



Met 


Gly 


Ser 


Ser 


1 








Arg 


Gly 


Ser 


His 








20 


He 


Pro 


Met 


Ala 






35 




Gly 


Lys 


Glu 


Ser 




50 






Glu 


Val 


He 


Asp 


65 








Ala 


Phe 


Trp 


His 


Pro 


Thr 


TV 1 

Al a 


Glu 








100 


Ala 


Phe 


Ala 


Arg 






115 




He 


Glu 


Tyr 


Phe 




130 






Thr 


Leu 


Arg 


Glu 


14 5 








Lys 


Glu 


TV j-ji 

Arg 


Me t 


Asn 


Leu 


1 

Phe 


Ser 








18 0 


Ser 


Ala 


Asp 


Val 






195 




Glu 


He 


Thr 


Lys 




210 






Arg 


Glu 


Gly 


Tyr 


225 








GlU 


Asn 


Le u 


Mia 


lie 


Gly 


rne 


i n r 








260 


Thr 


Lys 


His 


Gin 






275 




Lys 


Asn 


His 


Gly 




290 






His 


Ala 


Thr 


Leu 


305 








Arg 


He 


Leu 


Gly 


Leu 


Leu 


Gly 


Trp 








340 


Thr 


Leu 


Ala 


Met 






355 




Gly 


Leu 


Asn 


Phe 




370 






Asp 


Leu 


Phe 


He 



385 



His 


His 


His 


HIS 


5 








Met 


Ala 


Ser 


Met 


Glu 


rne 


D K £2> 

rne 


r i O 










Thr 


Asn 


Pro 


Leu 






55 




Gly 


Lys 


Pro 


Leu 




7 0 






Thr 


Phe 


val 


7\ c n 

as n 


8 5 








Arg 


Pro 


Trp 


as n 


va 1 


ASp 


a i a 










i 9 n 


Cys 


Phe 


His 


Asp 




135 




Thr 


as n 


1j y ±> 


T 1 ^ 
lie 




IjU 






Lys 


ASp 


Q o r - 
OCX 


Moll 


16b 








HIS 


D v- r-\ 

fro 


Zi y~ rt 

ai y 


T \; t 

i y r 


r ne 


A i a 


i yr 


A 1 A 










Glu 


Leu 


Gly 


Gly 






215 




Glu 


1 n r 


Leu 


LcU 




OTA 








php 

r 11 c 




Arn 


2 4b 








C 1 \/ 
o ± y 


G 1 n 


Phe 


Leu 


Tyr 


Asp 


Phe 


Asp 






280 


Leu 


Asp 


Glu 


Tyr 






295 




Ala 


Gly 


His 


Thr 




310 






Lys 


Leu 


Gly 


Ser 


325 








Asp 


Thr 


Asp 


Gin 


Tyr 


Glu 


Val 


lie 






360 


Asp 


Ala 


Lys 


Val 






375 




Gly 


His 


He 


Ala 



390 



His His Ser Ser 
10 

Thr Gly Gly Gin 

25 

Glu lie Pro Lys 

Ala Phe Arg Phe 

60 

Lys Asp His Leu 
75 

Glu Gly Arg Asp 
90 

Arg Phe Ser Asp 
105 

Phe Glu Phe Cys 

Arg Asp lie Ala 

140 

Leu Asp Lys Val 
155 

Val Lys Leu Leu 
170 

Met His Gly Ala 
185 

Ala Ala Gin Val 

Glu Gly Tyr Val 

220 

Asn Thr Asp Leu 
235 

Met Ala Val Glu 
250 

lie Glu Pro Lys 

265 

Val Ala Thr Ala 

Phe Lys Phe Asn 

300 

Phe Gin His Glu 
315 

He Asp Ala Asn 
330 

Phe Pro Thr Asn 
345 

Lys Ala Gly Gly 

Arg Arg Ala Ser 

380 

Gly Met Asp Thr 
395 



Gly 


Leu 


Val 


Pro 






15 




O ± 1 1 


1 J C. L, 




Ar a 




n 






He 

-i- -L. V—» 


Gin 


Phe 


Glu 










T v r 


As D 


Pro 


Asn 


Lys 


Phe 


Ser 


Val 






80 


Pro 


Phe 


Gly 


Asp 






95 




Pro 


Met 


Asp 


Lys 




110 






Glu 


Lys 


Leu 


Asn 


125 








Pro 


Glu 


Gly 


Lys 


Va 1 


Glu 


Arg 


He 








160 


Trp 


Gly 


Thr 


Ala 






175 




Ala 


Th r 


Thr 


Cys 




190 






Lys 


Lys 


Ala 


Leu 


205 








Phe 


Trp 


Gly 


Gly 

ml 


Gly 


Leu 


Glu 


Leu 








240 


Tyr 


Ala 


Lys 


Lys 






2 5 5 




Pro 


Lys 


Glu 


Pro 




270 






Tyr 


Ala 


Phe 


Leu 


285 








lie 


Glu 


Ala 


Asn 


Leu 


Arg 


Met 


Ala 








320 


Gin 


Gly 


Asp 


Leu 






335 




lie 


Tyr 


Asp 


Thr 




350 






Phe 


Thr 


Lys 


Gly 


365 








Tyr 


Lys 


Val 


Glu 


Phe 


Ala 


Leu 


Gly 



400 
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Phe Lys He Ala Tyr Lys Leu Ala 

405 

lie Glu Glu Lys Tyr Arg Ser Phe 

420 

Val Glu Gly Lys Thr Asp Phe Glu 
435 440 
Lys Glu Asp lie Glu Leu Pro Ser 

450 455 
Leu Leu Asn Ser Tyr He Val Lys 
465 470 



Lys 


Asp 


Gly 


Val 


Phe 


Asp 


Lys 


• Phe 


410 










4 15 




Lys 


Glu 


Gly 


He 


Gly 


Lys 


Glu 


He 


425 










430 






Lys 


Leu 


Glu 


Glu 


Tyr 


He 


He 


Asp 








44 5 








Gly 


Lys 


Gin 


Glu 
460 


Tyr 


Leu 


Glu 


Ser 


Thr 


He 


Ala 
475 


Glu 


Leu 


Arg 







<210> 43 
<211> 1436 
<212> DNA 

<213> Thermotoga neapolitana 
<400> 43 

atgggcagca gccatcatca tcatcatcac agcagcggcc tggtgccgcg cggcagccat 60 

atggctagca tgactggtgg acagcaaatg ggtcggatcc ccatggccga gttcttcccg 120 

gagatcccga aggtgcagtt cgagggcaag gagtccacca acccgctcgc cttcaagttc 180 

tacgacccgg aggagatcat cgacggcaag ccgctcaagg accacctcaa gttctccgtg 240 

gccttctggc acaccttcgt gaacgagggc cgcgacccgt tcggcgaccc gaccgccgac 300 

cgcccgtgga accgctacac cgacccgatg gacaaggcct tcgcccgcgt ggacgccctc 360 

ttcgagttct gcgagaagct caacatcgag tacttctgct tccacgaccg cgacatcccc 420 

cggagggcaa gaccctccgc gagaccaaca agatcctcga caaggtggtg gagcgcatca 480 

aggagcgcat gaaggactcc aacgtgaagc tcctctgggg caccgccaac ctcttctccc 540 

acccgcgcta catgcacggc gccgccacca cctgctccgc cgacgtgttc gcctacgccg 600 

ccgcccaggt gaagaaggcc ctggagatca ccaaggagct gggcggcgag ggctacgtgt 660 

Uctggggcgg ccgcgagggc tacgagaccc tcctcaacac cgacctcggc ttcgagctgg 720 

agaacctcgc ccgcttcctc cgcatggccg tggactacgc caagcgcatc ggcttcaccg 780 

gccagttcct catcgagccg aagccgaagg agccgaccaa gcaccagtac gacttcgacg 840 

tggccaccgc ctacgccttc ctcaagtccc acggcctcga cgagtacttc aagttcaaca 900 

tcgaggccaa ccacgccacc ctcgccggcc acaccttcca gcacgagctg cgcatggccc 960 

gcatcctcgg caagctcggc tccatcgacg ccaaccaggg cgacctcctc ctcggctggg 1020 

acaccgacca gttcccgacc aacgtgtacg acaccaccct cgccatgtac gaggtgatca 1080 

aggccggcgg cttcaccaag ggcggcctca acttcgacgc caaggtgcgc cgcgcctcct 1140 

acaaggtgga ggacctcttc atcggccaca tcgccggcat ggacaccttc gccctcggct 1200 
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gccaccccgg ccgactggcg ctcccagtcc atctacttcc tcctcaccga ccgcttcgcc 
cgcaccgacg gctccaccac cgccacctgc aacaccgccg accagaagta ctgcggcggc 
acctggcagg gcatcatcga caagctcgac tacatccagg gcatgggctt caccgccatc 
tggatcaccc cggtgaccgc ccagctcccg cagaccaccg cctacggcga cgcctaccac 240 
ggctactggc agcaggacat ctactccctc aacgagaact acggcaccgc cgacgacctc 300 
aaggccctct cctccgccct ccacgagcgc ggcatgtacc tcatggtgga cgtggtggcc 360 
aaccacatgg gctacgacgg cgccggctcc tccgtggact actccgtgtt caagccgttc 420 
tcctcccagg actacttcca cccgttctgc ttcatccaga actacgagga ccagacccag 480 
gtggaggact gctggctcgg cgacaacacc gtgtccctcc cggacctcga caccaccaag 540 
gacgtggtga agaacgagtg gtacgactgg gtgggctccc tcgtgtccaa ctactccatc 600 
gacggcctcc gcatcgacac cgtgaagcac gtgcagaagg acttctggcc gggctacaac 660 
aaggccgccg gcgtgtactg catcggcgag gtgctcgacg tggacccggc ctacacctgc 720 
ccgtaccaga acgtgatgga cggcgtgctc aactacccga tctactaccc gctcctcaac 780 
gccttcaagt ccacctccgg ctcgatggac gacctctaca acatgatcaa caccgtgaag 840 
tccgactgcc cggactccac cctcctcggc accttcgUgg agaaccacga caacccgcgc 900 
ttcgcctcct acaccaacga catcgccctc gccaagaacg tggccgcctt catcatcctc 960 
aacgacggca tcccgatcat ctacgccggc caggagcagc actacgccgg cggcaacgac 1020 
ccggccaacc gcgaggccac ctggctctcc ggctacccga ccgactccga gctgtacaag 1080 
ctcatcgcct ccgccaacgc catccgcaac tacgccatct ccaaggacac cggcttcgtg 1140 
acctacaaga actggccgat ctacaaggac gacaccacca tcgccatgcg caagggcacc 1200 
gacggctccc agatcgtgac catcctctcc aacaagggcg cctccggcga ctcctacacc 1260 
ctctccctct ccggcgccgg ctacaccgcc ggccagcagc tcaccgaggt gatcggctgc 1320 
accaccgtga ccgtgggctc cgacggcaac gtgccggtgc cgatggccgg cggcctcccg 1380 
cgcgtgctct acccgaccga gaagctcgcc ggctccaaga tatgctcctc ctccaagccg 1440 
gccaccctcg actcctggct ctccaacgag gccaccgtgg cccgcaccgc catcctcaac 1500 
aacatcggcg ccgacggcgc ctgggtgtcc ggcgccgact ccggcatcgt ggtggcctcc 1560 
ccgtccaccg acaacccgga ctacttctac acctggaccc gcgactccgg catcgtgctc 1620 
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aagaccctcg tggacctctt ccgcaacggc gacaccgacc tcctctccac catcgagcac 1680 

tacatctcct cccaggccat catccagggc gtgtccaacc cgtccggcga cctctcctcc 1740 

ggcggcctcg gcgagccgaa gttcaacgtg gacgagaccg cctacgccgg ctcctggggc 1800 

cgcccgcagc gcgacggccc ggccctccgc gccaccgcca tgatcggctt cggccagtgg 1860 

ctcctcgaca acggctacac ctccgccgcc accgagatcg tgtggccgct cgtgcgcaac 1920 

gacctctcct acgtggccca gtactggaac cagaccggct acgacctctg ggaggaggtg 1980 

aacggctcct ccttcttcac catcgccgtg cagcaccgcg ccctcgtgga gggctccgcc 2040 

ttcgccaccg ccgtgggctc ctcctgctcc tggtgcgact cccaqqcccc gcagatcctc 2100 

tgctacctcc agtccttctg gaccggctcc tacatcctcg ccaacttcga ctcctcccgc 2160 

tccggcaagg acaccaacac cctcctcggc tccatccaca ccttcgaccc ggaggccggc 2220 

tgcgacgact ccaccttcca gccgtgctcc ccgcgcgccc tcgccaacca caaggaggtg 2280 

gtggactcct tccgctccat ctacaccctc aacgacggcc tctccgactc cgaggccgtg 2340 

gccgtgggcc gctacccgga ggactcctac tacaacggca acccgtggtt cctctgcacc 2400 

ctcgccgccg ccgagcagct ctacgacgcc ctctaccagt gggacaagca gggctccctg 2460 

gagatcaccg acgtgtccct cgacttcttc aaggccctct actccggcgc cgccaccggc 2520 

acctactcct cctcctcctc cacctactcc tccatcgtgt ccgccgtgaa gaccttcgcc 2580 

gacggcttcg tgtccatcgt ggagacccac gccgcctcca acggctccct ctccgagcag 2640 

ttcgacaagt. ccgacggcga cgagctgtcc gcccgcgacc tcacctggtc ctacgccgcc 2700 

ctcctcaccg ccaacaaccg ccgcaactcc gtggtgccgc cgtcctgggg cgagacctcc 2760 

gcctcctccg tgccgggcac ctgcgccgcc acctccgcct ccggcaccta ctcctccgtg 2820 

accgtgacct cctggccgtc catcgtggcc accggcggca ccaccaccac cqccaccacc 2880 

accggctccg gcggcgtgac ctccacctcc aagaccacca ccaccgcctc caagacctcc 2940 

accaccacct cctccacctc ctgcaccacc ccgaccgccg tggccgtgac cttcgacctc 3000 

accgccacca ccacctacgg cgagaacatc tacctcgtgg gctccatctc ccagctcggc 3060 

gactgggaga cctccgacgg catcgccctc tccgccgaca agtacacctc ctccaacccg 3120 

ccgtggtacg tgaccgtgac cctcccqgcc ggcgagtcct tcgagtacaa gttcatccgc 3180 

gtggagtccg acgactccgt ggagtgggag tccgacccga accgcgagta caccgtgccg 3240 

caggcctgcg gcgagtccac cgccaccgtg accgacacct ggcgc 3285 

<210> 47 
<211> 679 
<212> PRT 

<213> Thermoanaerobacterium thermosaccharolyticum 
<400> 47 

Val Leu Ser Gly Cys Ser Asn Asn Val Ser Ser He Lys He Asp Arg 

15 10 15 

Phe Asn Asn He Ser Ala Val Asn Gly Pro Gly Glu Glu Asp Thr Trp 

20 25 30 

Ala Ser Ala Gin Lys Gin Gly Val Gly Thr Ala Asn Asn Tyr Val Ser 

35 40 45 

Arg Val Trp Phe Thr Leu Ala Asn Gly Ala He Ser Glu Val Tyr Tyr 

50 55 60 

Pro Thr He Asp Thr Ala Asp Val Lys Glu He Lys Phe He Val Thr 
65 70 75 80 

Asp Gly Lvs Ser Phe Val Ser Asp Glu Thr Lys Asp Ala He Ser Lys 

85 90 95 

Val Glu Lys Phe Thr Asp Lys Ser Leu Gly Tyr Lys Leu Val Asn Thr 

100 105 110 

Asp Lys Lys Gly Arg Tyr Arg He Thr Lys Glu lie Phe Thr Asp Val 

115 120 125 

Lys Arg Asn Ser Leu He Met Lys Ala Lys Phe Glu Ala Leu Glu Gly 

130 135 140 

Ser He His Asp Tyr Lys Leu Tyr Leu Ala Tyr Asp Pro His He Lys 
145 150 155 160 

Asn Gin Gly Ser Tyr Asn Glu Gly Tyr Val lie Lys Ala Asn Asn Asn 

165 170 175 

Glu Met Leu Met Ala Lys Arg Asp Asn Val Tyr Thr Ala Leu Ser Ser 
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Trp 


Thr 


Lys 




275 










280 






Asn 


Phe 


Asn 


Gly 


Lys 


Ala 


Asn 


Ser 


Leu 


Tyr 
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He 
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He 
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He 
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Gly 
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595 










600 






Tyr 


Glu 


He 


Ala 


Ala 


Gly 


Lys 


Asp 


Ala 


Thr 


610 










615 








Glu 
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Tyr 


Cys 


Asn 


Thr 


Leu 


Asn 
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315 
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Asp 
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Arg 


Ser 


Leu 


Asp 


Tyr 


Leu 
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Asp 
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Glu 
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645 650 655 

Ala Glu Tyr Val lie Leu Phe Ala Ser Asn lie Glu His Lys Val Leu 

660 665 670 

Asp Met Pro Asp lie Val Tyr 
675 



<210> 48 
<211> 2037 
<212> DNA 

<213> Thermoanaerobacter ium t hermosaccha rol yt icum 

< 2 2 C> 

<223> synthetic 
<400> 48 

gtgctctccg gctgctccaa caacgtgtcc tccatcaaga tcgaccgctt caacaacatc 60 

tccgccgtga acggcccggg cgaggaggac acctgggcct ccgcccagaa gcagggcgtg 120 

ggcaccgcca acaactacgt gtcccgcgtg tggttcaccc tcgccaacgg cgccatctcc 180 

gaggtgtact acccgaccat cgacaccgcc gacgtgaagg agatcaagtt catcgtgacc 240 

gacggcaagt ccttcgtgtc cgacgagacc aaggacgcca tctccaaggt ggagaagttc 300 

accgacaagt ccctcggcta caagctcgtg aacaccgaca agaagggccg ctaccgcatc 360 

accaaggaaa tcttcaccga cgtgaagcgc aactccctca tcatgaaggc caagttcgag 420 

gccctcgagg gctccatcca cgactacaag ctctacctcg cctacgaccc gcacatcaag 480 

aaccagggct cctacaacga gggctacgtg atcaaggcca acaacaacga gatgctcatg 540 

gccaagcgcg acaacgtgta caccgccctc tcctccaaca tcggctggaa gggctactcc 600 

atcggctact acaaggtgaa cgacatcatg accgacctcg acgagaacaa gcagatgacc 660 

aagcactacg actccgcccg cggcaacatc atcgagggcg ccgagatcga cctcaccaag 720 

aactccgagt tcgagatcgt gctctccttc ggcggctccg actccgaggc cgccaagacc 780 

gccctcgaga ccctcggcga ggactacaac aacctcaaga acaactacat cgacgagtgg 840 

accaagtact gcaacaccct caacaacttc aacggcaagg ccaactccct ctactacaac 900 

tccatgatga tcctcaaggc ctccgaggac aagaccaaca agggcgccta catcgcctcc 960 

ctctccatcc cgtggggcga cggccagcgc gacgacaaca ccggcggcta ccacctcgtg 1020 

tggtcccgcg acctctacca cgtggccaac gccttcatcg ccgccggcga cgtggactcc 1080 

gccaaccgct ccctcgacta cctcgccaag gtggtgaagg acaacggcat gatcccgcag 1140 

aacacctgga tctccggcaa gccgtactgg acctccatcc agctcgacga gcaggccgac 1200 

ccgatcatcc tctcctaccg cctcaagcgc tacgacctct acgactccct cgtgaagccg 1260 

ctcgccgact tcatcatcaa gatcggcccg aagaccggcc aggagcgctg ggaggagatc 1320 

ggcggctact ccccqgccac gatggccgcc gaggtggccg gcctcacctg cgccgcctac 1380 

atcgccgagc agaacaagga ctacgagtcc gcccagaagt accaggagaa ggccgacaac 1440 

tggcagaagc tcatcgacaa cctcacctac accgagaacg gcccgctcgg caacggccag 1500 

tactacatcc gcatcgccgg cctctccgac ccgaacgccg acttcatgat caacatcgcc 1560 

aacggcggcg gcgtgtacga ccagaaggag atcgtggacc cgtccttcct cgagctggtg 1620 

cgcctcggcg tgaagtccgc cgacgacccg aagatcctca acaccctcaa ggtggtggac 1680 

tccaccatca aggtggacac cccgaagggc ccgtcctggt atcgctacaa ccacgacggc 1740 

tacggcgagc cgtccaagac cgagctgtac cacggcgccg gcaagggccg cctctggccg 1800 

ctcctcaccg gcgagcgcgg catgtacgag atcgccgccg gcaaggacgc caccccgtac 1860 

gtgaaggcga tggagaagtt cgccaacgag ggcggcatca tctccgagca ggtgtgggag 1920 

gacaccggcc tcccgaccga ctccgcctcc ccgctcaact gggcccacgc cgagtacgtg 1980 

atcctcttcg cctccaacat cgagcacaag gtgctcgaca tgccggacat cgtgtac 2037 

<210> 49 
<211> 579 
<212> PRT 

<213> Rhizopus oryzae 
<400> 49 

Ala Ser lie Pro Ser Ser Ala Ser Val Gin Leu Asp Ser Tyr Asn Tyr 
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465 470 475 480 

Asn Gly Gly Val Thr Val Ser Ser He Ser Leu Pro Phe Phe Lys Lys 

485 490 495 

Phe Asp Ser Ser Ala Thr Ser Gly Lys Lys Tyr Thr Val Gly Thr Ser 

500 505 510 

Asp Phe Asn Asn Leu Ala Gin Asn He Ala Leu Ala Ala Asp Arg Phe 

515 520 525 

Leu Ser Thr Val Gin Leu His Ala His Asn Asn Gly Ser Leu Ala Glu 

530 535 540 

Glu Phe Asp Arg Thr Thr Gly Leu Ser Thr Gly Ala Arg Asp Leu Thr 
545 550 555 560 

Trp Ser His Ala Ser Leu He Thr Ala Ser Tyr Ala Lys Ala Gly Ala 

565 570 575 

Pro Ala Ala 



<210> 50 
<211> 1737 
<212> DNA 

<213> Rhizopus oryzae 
<400> 50 

gcctccatcc cgtcctccgc ctccgtgcag ctcgactcct acaactacga cggctccacc 60 
ttctccggca aaatctacgt gaagaacatc gcctactcca agaaggtgac cgtgatctac 120 
gccgacggct ccgacaactg gaacaacaac ggcaacacca tcgccgcctc ctactccgcc 180 
ccgatctccg gctccaacta cgagtactgg accttctccg cctccatcaa cggcatcaag 240 
gagttctaca tcaagtacga ggtgtccggc aagacctact acgacaacaa caactccgcc 300 
aactaccagg tgtccacctc caagccgacc accaccaccg ccaccgccac caccaccacc 360 
gccccgtcca cctccaccac caccccgccg tcccgctccg agccggccac cttcccgacc 420 
ggcaactcca ccatctcctc ctggatcaag aagcaggagg gcatctcccg cttcgccatg 480 
ctccgcaaca tcaacccgcc gggctccgcc accggcttca tcgccgcctc cctctccacc 540 
gccggcccgg actactacta cgcctggacc cgcgacgccg ccctcacctc caacgtgatc 600 
gtgtacgagt acaacaccac cctctccggc aacaagacca tcctcaacgt gctcaaggac 660 
tacgtgacct tctccgtgaa gacccagtcc acctccaccg tgtgcaactg cctcggcgag 720 
ccgaagttca acccggacgc ctccggctac accggcgcct ggggccgccc gcagaacgac 780 
ggcccggccg agcgcgccac caccttcatc ctcttcgccg actcctacct cacccagacc 840 
aaggacgcct cctacgtgac cggcaccctc aagccggcca tcttcaagga cctcgactac 900 
gtggtgaacg tgtggtccaa cggctgcttc gacctctggg aggaggtgaa cggcgtgcac 960 
ttctacaccc tcatggtgat gcgcaagggc ctcctcctcg gcgccgactt cgccaagcgc 1020 
aacggcgact ccacccgcgc ctccacctac tcctccaccg cctccaccat cgccaacaaa 1080 
atctcctcct tctgggtgtc ctccaacaac tggatacagg tgtcccagtc cgtgaccggc 1140 
ggcgtgtcca agaagggcct cgacgtgtcc accctcctcg ccgccaacct cggctccgtg 1200 
gacgacggct tcttcacccc gggctccgag aagatcctcg ccaccgccgt ggccgtggag 1260 
gactccttcg cctccctcta cccgatcaac aagaacctcc cgtcctacct cggcaactcc 1320 
atcggccgct acccggagga cacctacaac ggcaacggca actcccaggg caactcctgg 1380 
ttcctcgccg tgaccggcta cgccgagctg tactaccgcg ccatcaagga gtggatcggc 1440 
aacggcggcg tgaccgtgtc ctccatctcc ctcccgttct tcaagaagtt cgactcctcc 1500 
gccacctccg gcaagaagta caccgtgggc acctccgact tcaacaacct cgcccagaac 1560 
atcgccctcg ccgccgaccg cttcctctcc accgtgcagc tccacgccca caacaacggc 1620 
tccctcgccg aggagttcga ccgcaccacc ggcctctcca ccggcgcccg cgacctcacc 1680 
tggtcccacg cctccctcat caccgcctcc tacgccaagg ccggcgcccc ggccgcc 1737 

<210> 51 

<211> 439 

<2i2> PRT 

<213> Artificial Sequence 
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<220> 

<223> synthetic 



<400> 51 



Met 
1 


Ala 


Lys 


His 


Leu 

l—> U* 

5 


Ala 


Ala 


Met 


A. 

Va 1 


T i ) 


J_J ^ 


Cvs 


Leu 


Gl v 


Ser 


Gin 








20 










Gin 


Gl v 


Phe 


As n 


Trp 


Glu 


Ser 


Trp 

tr 
















40 


T vr 




Le u 


Gl y 


Arg 


Val 


Asp 


Asp 




SO 










55 




Val 

V 1 — ■* _1_ 


Trn 


Le u 


Pro 


Gin 


Pro 


Ser 


His 


6 5 








70 






Pro 


Gl v 


A r a 


Le u 


Tvr 


Asp 


Leu 


Asp 










85 








Gin 


T ,p \ l 

l-J \j 




Ser 


Le u 


Thr 


Ala 


Ala 






100 










Va 1 


Ala 


Asd 


Val 


Val 


He 


Asn 


His 






115 










120 


Am 


Gl v 


lie 


Tvr 


Cvs 


Val 


Phe 


Glu 




1 30 










135 




Asp 


Trp 


Gly 


Pro 


Asp 


Met 


He 


Cys 


14 5 










150 






Gl v 


Arg 


Gly 


His 


Arq 


Asp 


Thr 


Gly 










165 








lie 


Asp 


His 


Leu 


Asn 


Pro 


Arg 


Val 








180 










Asn 


Trp 


Leu 


Lys 


Ser 


Asp 


Leu 


Gly 






195 










200 


Ala 


Lys 


Gly 


Tyr 


Ser 


Ala 


Ala 


Val 




210 










215 




Ala 


Pro 


Thr 


Phe 


Val 


Val 


Ala 


Glu 


225 










230 






Gl v 


Asn 


Gly 


Glu 


Pro 


Ser 


Ser 


Asn 










24 5 








Val 


Asn 


Trp 


Ala 


Gin 


Ala 


Val 


Gly 








260 










Thr 


Thr 


Lys 


Gly 


Val 


Leu 


Gin 


Ala 






275 










280 


Met 


Lys 


Asp 


Gly 


Asn 


Gly 


Lys 


Ala 




290 










295 




Glu 


Lys 


Ala 


Val 


Thr 


Phe 


Val 


Asp 


305 








310 






As n 


Ser 


Trp 


Pro 


Phe 


Pro 


Ser 


Asp 










32 5 

—S C~ S 








lie 


Leu 


Thr 


His 


Pro 


Gly 


Thr 


Pro 








340 










Asp 


Trp 


Asn 


Leu 


Lys 


Gin 


Glu 


He 






355 










360 


Arg 


Asn 


Gly 


He 


His 


Pro 


Gly 


Ser 


370 










375 




Gly 


Asp 


Leu 


Tyr 


Val 


Ala 


Lys 


He 


385 










390 






Gly 


Ser 


Arg 


Tyr 


Asp 


Val 


Gly 


Asn 










405 








Val 


Ala 


His 


Gly 


Asn 


Asn 


Tyr 


Cys 








420 











Cys 


Trp 


Cys 


Ser 


Leu 


Leu 


Val 


Leu 




10 










15 




Leu 


Ala 


Gin 


Ser 


Gin 


Val 


Leu 


Phe 


25 










30 






Lys 


Lys 


Gin 


Gly 


Gly 


Trp 


Tyr 


Asn 










45 








He 


Ala 


Ala 


Thr 


Gly 


Ala 


Thr 


His 








60 










Ser 


Val 


Ala 


Pro 


Gin 


Gly 


Tyr 


Met 






75 










80 


Ala 


Ser 


Lys 


Tyr 


Gly 


Thr 


His 


Ala 




90 










95 




Phe 


His 


Ala 


Lys 


Gly 


Val 


Gin 


Cys 


105 










HO 






Arg 


Cys 


Ala 


Asp 


Tyr 


Lys 


Asp 


Gly 










125 








Gly 


Gly 


Thr 


Pro 


Asp 


Ser 


Arg 


Leu 








140 










Ser 


Asp 

L 


Asp 


Thr 


Gin 


Tyr 


Ser 


Asn 






155 










160 


Ala 


Asp 


Phe 


Ala 


Ala 


Ala 


Pro 


Asp 




170 










175 




Gin 


Gin 


Glu 


Leu 


Ser 


Asp 


Trp 


Leu 


185 










190 






Phe 


Asp 


Gly 


Trp 


Arg 


Leu 


Asp 


Phe 










205 








Ala 


Lys 


Val 


Tyr 


Val 


Asp 


Ser 


Thr 








220 










He 


Trp 


Ser 


Ser 


Leu 


His 


Tyr 


Asp 






235 










240 


Gin 


Asp 


Ala 


Asp 


Arg 


Gin 


Glu 


Leu 




250 










255 




Gly 


Pro 


Ala 


Ala 


Ala 


Phe 


Asp 


Phe 


265 










270 






Ala 


Val 


Gin 


Gly 

■4 


Glu 


Leu 


Trp 


Arg 










285 








Pro 


Gly 


Met 


He 


Gly 


Trp 


Leu 


Pro 








300 










Asn 


His 


Asp 


Thr 


Gly 


Ser 


Thr 


Gin 






315 










320 


Lys 


Val 


Met 


Gin 


Gly 


Tyr 


Ala 


Tyr 




330 










335 




Cys 


He 


Phe 


Tyr 


Asp 


His 


Val 


Phe 


345 










350 






Ser 


Ala 


Leu 


Ser 


Ala 


Val 


Arg 


Ser 










365 








Glu 


Leu 


Asn 


He 


Leu 


Ala 


Ala 


Asp 








380 










Asp 


Asp 


Lys 


Val 


lie 


Val 


Lys 


He 






395 










400 


Leu 


He 


Pro 


Ser 


Asp 


Phe 


His 


Ala 




410 










4 15 




Val 


Trp 


Glu 


Lys 


His 


Gly 


Leu 


Arg 


425 










430 
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Val Pro Ala Gly Arg His His 
435 



<210> 52 
< 2 1 1 > 132 0 
<212> DNA 

<213> Artificial Sequence 



??0> 



^ /L cl 

<22 3> synthetic 
<4 00> 52 

atggcgaagc acttggctgc catgtgctgg tgcagcctcc tagtgcttgt actgctctgc 60 

ttgggctccc agctggccca atcccaggtc ctcttccagg ggttcaactg ggagtcgtgg 120 

aagaagcaag gtgggtggta caactacctc ctggggcggg tggacgacat cgccgcgacg 180 

ggggccacgc acgtctggct cccgcagccg tcgcactcgg tggcgccgca ggggtacatg 240 

cccggccggc tctacgacct ggacgcgtcc aagtacggca cccacgcgga gctcaagtcg 300 

ctcaccgcgg cgttccacgc caagggcgtc cagtgcgtcg ccgacgtcgt gatcaaccac 360 

cgctgcgccg actacaagga cggccgcggc atctactgcg tcttcgaggg cggcacgccc 420 

gacagccgcc tcgactgggg ccccgacatg atctgcagcg acgacacgca gtactccaac 480 

gggcgcgggc accgcgacac gggggccgac ttcgccgccg cgcccgacat cgaccacctc 540 

aacccgcgcg tgcagcagga gctctcggac tggctcaact ggctcaagtc cgacctcggc 600 

ttcgacggct ggcgcctcga cttcgccaag ggctactccg ccgccgtcgc caaggtgtac 660 

gtcgacagca ccqcccccac cttcgtcgtc gccgagatat ggagctccct ccactacgac 720 

ggcaacggcg agccgtccag caaccaggac gccgacaggc aggagctggt caactgggcg 780 

caggcggtgg gcggccccgc cgcggcgttc gacttcacca ccaagggcgt gctgcaggcg 840 

gccgtccagg gcgagctgtg gcgcatgaag gacggcaacg gcaaggcgcc cgggatgatc 900 

ggctggctgc cggagaaggc cgtcacgttc gtcgacaacc acgacaccgg ctccacgcag 960 

aactcgtggc cattcccctc cgacaaggtc atgcagggct acgcctatat cctcacgcac 1020 

ccaggaactc catgcatctt ctacgaccac gttttcgact ggaacctgaa gcaggagatc 1080 

agcgcgctgt ctgcggtgag gtcaagaaac gggatccacc cggggagcga gctgaacatc 1140 

ctcgccgccg acggggatct ctacgtcgcc aagattgacg acaaggtcat cgtgaagatc 1200 

gggtcacggt acgacgtcgg gaacctgatc ccctcagact tccacgccgt tgcccctggc 1260 

aacaactact gcgtttggga gaagcacggt ctgagagttc cagcggggcg gcaccactag 1320 



<210> 53 
<211> 45 
<212> PRT 

<213> Artificial Sequence 

<220> 

<223> synthetic 
<400> 53 

Ala Thr Gly Gly Thr Thr Thr Thr Ala Thr Thr Thr Gly Ser Gly Gly 

15 10 15 

Val Thr Ser Thr Ser Lys Thr Thr Thr Thr Ala Ser Lys Thr Ser Thr 

20 25 30 

Thr Thr Ser Ser Thr Ser Cys Thr Thr Pro Thr Ala Val 
35 40 45 



<210> 54 
<211> 137 
<212> DNA 
<213> Artificial 



Sequence 
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<220> 

<223> synthetic 
<4 00> 54 

gccaccggcg gcaccaccac caccgccacc 
tccaagacca ccaccaccgc ctccaagacc 
accccgaccg ccgtgtc 



accaccggct ccggcggcgt gacctccacc 60 
tccaccacca cctcctccac ctcctgcacc 120 

137 



<210> 55 
<211> 300 
<212> PRT 
<213> Pyrococcus 



furiosus 



<400> 55 












He 
1 


Tyr 


Phe 


Val 


Glu 
5 


Lys 


Tyr 


His 


Asn 


Thr 


Sei- 


Ser 
20 


Thr 


Pro 


Pro 


Gin 


Leu 


Lys 


Ile 


Arg 


Tyr 


Pro 


Asp 


Asp 






35 










40 


Asp 


Lys 


Asp 


Gly 


Asp 


Gly 


Asn 


Pro 




50 










55 




Trp 


Asn 


lie 


Leu 


Asn 


Ala 


Thr 


Gly 


65 










70 






Thr 


Set- 


Gly 


Val 


Leu 


His 


Tyr 


Val 










85 








Arg 


Asp 


Arg 


Ser 


Asn 


Trp 


Val 


His 








100 










Asn 


Lys 


Pro 


Trp 


Asn 


Ala 


Asn 


Tyr 




115 










120 


Pro 


Ser 


Lys 


Val 


Ser 


Asn 


Leu 


Thr 




130 








135 




Lys 


Leu 


Glu 


Pro 


Lys 


Asn 


Gly 


Leu 


145 










150 






Trp 


Leu 


Thr 


Arg 


Glu 


Ala 


Trp 


Arg 










165 








Gin 


Glu 


Val 


Met 


He 


Trp 


lie 


Tyr 








180 










Ser 


Lys 


Val 


Lys 


Glu 


He 


Val 


Val 




195 










200 


Val 


Asn 


Ala 


Thr 


Phe 


Glu 


Val 


Trp 




210 










215 




Val 


Ala 


Phe 


Arg 


He 


Lys 


Thr 


Pro 


225 










230 






Pro 


Tyr 


Gly 


Ala 


Phe 


lie 


Ser 


Val 








245 








Asn 


Tyr 


Thr 


Glu 


Leu 


Tyr 


Leu 


Glu 






260 










Gly 


Thr 


Pro 


Ser 


Thr 


Thr 


Ser 


Ala 




275 










280 


He 


Thr 


Leu 


Thr 


Pro 


Leu 


Asp 


Arg 




290 










295 





Thr 


Ser 


Glu 


Asp 


Lys 


Ser 


Thr 


Ser 




10 










15 




Thr 


Thr 


Leu 


Ser 


Thr 


Thr 


Lys 


Val 


25 










30 






Gly 


Glu 


Trp 


Pro 


Gly 


Ala 


Pro 


lie 










4 5 








Glu 


Phe 


Tyr 


He 


Glu 


He 


Asn 


Leu 








60 










Phe 


Ala 


Glu 


Met 


Thr 


Tyr 


Asn 


Leu 






75 










80 


Gin 


Gin 


Leu 


Asp 


Asn 


He 


Val 


Leu 




90 










95 




Gly 


Tyr 


Pro 


Glu 


He 


Phe 


Tyr 


Gly 


105 










no 






Ala 


Thr 


Asp 


Gly 


Pro 


lie 


Pro 


Leu 










125 








Asp 


Phe 


Tyr 


Leu 


Thr 


lie 


Ser 


Tyr 








140 










Pro 


He 


Asn 


Phe 


Ala 


lie 


Glu 


Ser 






155 










160 


Thr 


Thr 


Gly 


He 


Asn 


Ser 


Asp 


Glu 




170 










175 




Tyr 


Asp 


Gly 


Leu 


Gin 


Pro 


Ala 


Gly 


185 










190 






Pro 


He 


He 


Val 


Asn 


Gly 


Thr 


Pro 










205 








Lys 


Ala 


Asn 


He 


Gly 


Trp 


Glu 


Tyr 








220 










He 


Lys 


Glu 


Gly 


Thr 


Val 


Thr 


He 






235 










240 


Ala 


Ala 


Asn 


He 


Ser 


Ser 


Leu 


Pro 




250 










255 




Asp 


Val 


Glu 


He 


Gly 


Thr 


Glu 


Phe 


265 










270 






His 


Leu 


Glu 


Trp 


Trp 


lie 


Thr 


Asn 



285 



Pro Leu lie Ser 

300 



<210> 56 
<211> 903 
<212> DNA 
<213> Pyrococcus 



furiosus 
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<400> 56 

atctacttcg tggagaagta ccacacctcc gaggacaagt ccacctccaa cacctcctcc 60 

accccgccgc agaccaccct ctccaccacc aaggtgctca agatccgcta cccggacgac 120 

ggcgagtggc ccggcgcccc gatcgacaag gacggcgacg gcaacccgga gttctacatc 180 

gagatcaacc tctggaacat cctcaacgcc accggcttcg ccgagatgac ctacaacctc 240 

actagtggcg tgctccacta cgtgcagcag ctcgacaaca tcgtgctccg cgaccgctcc 300 

aactgggtgc acggctaccc ggaaatcttc tacggcaaca agccgtggaa cgccaactac 360 

gccaccgacg gcccgatccc gctcccgtcc aaggtgtcca acctcaccqa cttctacctc 420 

accatctcct acaagctcga gccgaagaac ggtctcccga tcaacttcgc catcgagtcc 480- 

tggctcaccc gcgaggcctg gcgcaccacc ggcatcaact ccgacgagca ggaggtgatg 540 

atctggatct actacgacgg cctccagccc gcgggctcca aggtgaagga gatcgtggtg 600 

ccgatcatcg tgaacggcac cccggtgaac gccaccttcg aggtgtggaa ggccaacatc 660 

ggctgggagt acgtggcctt ccgcatcaag accccgatca aggagggcac cgtgaccatc 720 

ccgtacggcg ccttcatctc cgtggccgcc aacatctcct ccctcccgaa ctacaccgaq 780 

aagtacctcg aggacgtgga gatcggcacc gagttcggca ccccgtccac cacctccgcc 840 

cacctcgagt ggtggatcac caacatcacc ctcaccccgc tcgaccgccc gctcatctcc 900 
tag 903 

<210> 57 
<211> 387 
<212> PRT 

<213> Thermus flavus 
<400> 57 



Met 


Tyr 


Glu 


Pro 


Lys 


Pro 


Glu 


His 


Arg 


Phe 


Thr 


Phe 


Gly 


Leu 


Trp 


Thr 


1 






5 










10 










15 




Val 


Asp 


Asn 


Val 


Asp 


Arg 


Asp 


Pro 


Phe 


Gly 


Asp 


Thr 


Val 


Arg 


Glu 


Arg 






20 










25 










30 






Leu 


Asp 


Pro 


Val 


Tyr 


Val 


Val 


His 


Lys 


Leu 


Ala 


Glu 


Leu 


Gly 


Ala 


Tyr 




35 








40 










45 








Gly 


Val 


Asn 


Leu 


His 


Asp 


Glu 


Asp 


Leu 


He 


Pro 


Arg 


Gly 


Thr 


Pro 


Pro 


50 










55 










60 










Gin 


Glu 


Arg 


Asp 


Gin 


He 


Val 


Arg 


Arg 


Phe 


Lys 


Lys 


Ala 


Leu 


Asp 


Glu 


65 






70 










75 










80 


Thr 


Val 


Leu 


Lys 


Val 


Pro 


Met 


Val 


Thr 


Ala 


Asn 


Leu 


Phe 


Ser 


Glu 


Pro 








85 










90 










95 




Ala 


Phe 


Arg 


Asp 


Gly 


Ala 


Ser 


Thr 


Thr 


Arg 


Asp 


Pro 


Trp 


Val 


Trp 


Ala 






100 








105 










110 






Tyr 


Ala 


Leu 


Arg 


Lys 


Ser 


Leu 


Glu 


Thr 


Met 


Asp 


Leu 


Gly 


Ala 


Glu 


Leu 




115 








120 










125 








Gly 


Ala 


Glu 


lie 


Tyr 


Met 


Phe 


Trp 


Met 


Val 


Arg 


Glu 


Arg 


Ser 


Glu 


Val 


130 










135 










140 










Glu 


Ser 


Thr 


Asp 


Lys 


Thr 


Arg 


Lys 


Val 


Trp 


Asp 


Trp 


Val 


Arg 


Glu 


Thr 


145 






150 










155 










160 


Leu 


Asn 


Phe 


Met 


Thr 


Ala 


Tyr 


Thr 


Glu 


Asp 


Gin 


Gly 


Tyr 


Gly 


Tyr 


Arg 








165 








170 










175 




Phe 


Ser 


Val 


Glu 


Pro 


Lys 


Pro 


Asn 


Glu 


Pro 


Arg 


Gly 


Asp 


He 


Tyr 


Phe 








180 








185 










190 






Thr 


Thr 


Val 


Gly 


Ser 


Met 


Leu 


Ala 


Leu 


lie 


His 


Thr 


Leu 


Asp 


Arg 


Pro 






195 








200 










205 








Glu 


Arg 


Phe 


Gly 


Leu 


Asn 


Pro 


Glu 


Phe 


Ala 


His 


Glu 


Thr 


Met 


Ala 


Gly 




210 








215 










220 










Leu 


Asn 


Phe 


Asp 


His 


Ala 


Val 


Ala 


Gin 


Ala 


Val 


Asp 


Ala 


Gly 


Lys 


Leu 


225 








230 










235 










240 


Phe 


His 


lie 


Asp 


Leu 


Asn 


Asp 


Gin 


Arg 


Met 


Ser 


Arg 


Phe 


Asp 


Gin 


Asp 








245 










250 










255 




Leu 


Arg 


Phe 


Gly 


Ser 


Glu 


Asn 


Leu 


Lys 


Ala 


Gly 


Phe 


Phe 


Leu 


Val 


Asp 






260 










265 










270 
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Leu 


Leu 


Glu 

275 


Ser 


Ser 


Gly 


Tyr 


Gin 
280 


Gly 


Pro 


Arg 


His 


Phe 
285 


Glu 


Ala 


His 


Ala 


Leu 


Arg 


Thr 


Glu 


Asp 


Glu 


Glu 


Gly 


Val 


Trp 


Thr 


Phe 


Val 


Arg 


Val 




290 








295 










300 










Cys 


Met 


Arg 


Thr 


Tyr 


Leu 


He 


He 


Lys 


Val 


Arg 


Ala 


Glu 


Thr 


Phe 


Arg 


305 






310 










315 










320 


Glu 


Asp 


Pro 


Glu 


Val 


Lys 


Glu 


Leu 


Leu 


Ala 


Ala 


Tyr 


Tyr 


Gin 


Glu 


Asp 








325 








330 










335 




Pro 


Ala 


Thr 


Leu 
340 


Ala 


Leu 


Leu 


Asp 


Pro 
345 


Tyr 


Ser 


Arg 


Glu 


Lys 
350 


Ala 


Glu 


Ala 


Leu 


Lys 


Arg 


Ala 


Glu 


Leu 


Pro 


Leu 


Glu 


Thr 


Lys 


Arg 


Arg 


Arg 


Gly 






355 








360 










365 








Tyr 


Ala 


Leu 


Glu 


Arg 


Leu 


Asp 


Gin 


Leu 


Ala 


Val 


Glu 


Tyr 


Leu 


Leu 


Gly 


370 










375 










380 










Val 


Arg 


Gly 




























335 

































60 
120 



<210> 58 
<211> 978 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic 
<400> 58 

atggggaaga acggcaacct gtgctgcttc tctctgctgc tgcttcttct cgccgggttg 

gcgtccggcc atcaaatcta cttcgtggag aagtaccaca cctccgagga caagtccacc 

tccaacacct cctccacccc gccgcagacc accctctcca ccaccaaggt gctcaagatc 180 

cgctacccgg acgacggtga gtggcccggc gccccgatcg acaaggacgg cgacggcaac 240 

ccggagttct acatcgagat caacctctgg aacatcctca acgccaccgg cttcgccgag 300 

atgacctaca acctcactag tggcgtgctc cactacgtgc agcagctcga caacatcgtg 360 

ctccgcgacc gctccaactg ggtgcacggc tacccggaaa tcttctacgg caacaagccg 420 

tggaacgcca actacgccac cgacggcccg atcccgctcc cgtccaaggt gtccaacctc 480 

accgacttct acctcaccat ctcctacaag ctcgagccga agaacggtct cccgatcaac 540 

ttcgccatcg agtcctggct cacccgcgag gcctggcgca ccaccggcat caactccgac 600 

gagcaggagg tgatgatctg gatctactac gacggcctcc agcccgcggg ctccaaggtg 660 

aaggagatcg tggtgccgat catcgtgaac ggcaccccgg tgaacgccac cttcgaggtg 720 

tggaaggcca acatcggctg ggagtacgtg gccttccgca tcaagacccc gatcaaggag 780 

ggcaccgtga ccatcccgta cggcgccttc atctccgtgg ccgccaacat ctcctccctc 

ccgaactaca ccgagaagta cctcgaggac gtggagatcg gcaccgagtt cggcaccccg 

tccaccacct ccgcccacct cgagtggtgg atcaccaaca tcaccctcac cccgct.cgac 960 
cgcccgctca tctcctag 978 

<210> 59 
<211> 1920 
<212> DNA 

<213> Aspergillus niger 



840 
900 



60 



<400> 59 

atgtccttcc gctccctcct cgccctctcc ggcctcgtgt gcaccggcct cgccaacgtg 

atctccaagc gcgccaccct cgactcctgg ctctccaacg aggccaccgt ggcccgcacc 120 

gccatcctca acaacatcgg cgccgacggc gcctgggtgt ccggcgccga ctccggcatc 180 

gtggtggcct ccccgtccac cgacaacccg gactacttct acacctggac ccgcgactcc 240 

ggcctcgtgc tcaagaccct cgtggacctc ttccgcaacg gcgacacctc cctcctctcc 300 

accatcgaga actacatctc cgcccaggcc atcgtgcagg gcatctccaa cccgtccggc 360 

gacctctcct ccggcgccgg cctcggcgag ccgaagttca acgtggacga gaccgcctac 420 
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accggctcct ggggccgccc gcagcgcgac ggcccggccc tccgcgccac "cgcca tga tc 480 
ggcttcggcc agtggctcct cgacaacggc tacacctcca ccgccaccga catcgtgtgg 540 
ccgctcgtgc gcaacgacct ctcctacgtg gcccagtact ggaaccagac cggctacgac 600 
ctctgggagg aggtgaacgg ctcctccttc ttcaccatcg ccgtgcagca ccgcgccctc 660 
gtggagggct ccgccttcgc caccgccgtg ggctcctcct gctcctggtg cgactcccag 720 
gccccggaga tcctctgcta cctccagtcc ttctggaccg gctccttcat cctcgccaac 780 
ttcgactcct cccgctccgg caaggacgcc aacaccctcc tcggctccat ccacaccttc 840 
gacccggagg ccgcctgcga cgactccacc ttccagccgt gctccccgcg cgccctcgcc 900 
aaccacaagg aggtggtgga ctccttccgc tccatctaca ccctcaacga cggcctctcc 960 
gactccgagg ccgtggccgt gggccgctac ccggaggaca cctactacaa cggcaacccg 1020 
tggttcctct gcaccctcgc cgccgccgag cagctctacg acgccctcta ccagtgggac 1080 
aagcagggct ccctcgaggt gaccgacgtg tccctcgact tcttcaaggc cctctactcc 1140 
gacgccgcca ccggcaccta ctcctcctcc tcctccacct actcctccat cgtggacgcc 1200 
gtgaagacct tcgccgacgg cttcgtgtcc atcgtggaga cccacgccgc ctccaacggc 1260 
tccatgtccg agcagtacga caagtccgac ggcgagcagc tctccgcccg cgacctcacc 1320 
tggtcctacg ccgccctcct caccgccaac aaccgccgca actccgtggt gccggcctcc 1380 
tggggcgaga cctccgcctc ctccgtgccg ggcacctgcg ccgccacctc cgccatcggc 1440 
acctactcct ccgtgaccgt gacctcctgg ccgtccatcg tggccaccgg cggcaccacc 1500 
accaccgcca ccccgaccgg ctccggctcc gtgacctcca cctccaagac caccgccacc 1560 
gcctccaaga cctccacctc cacctcctcc acctcctgca ccaccccgac cgccgtggcc 1620 
gtgaccttcg acctcaccgc caccaccacc tacggcgaga acatctacct cgtgggctcc 1680 
atctcccagc tcggcgactg ggagacctcc gacggcatcg ccctctccgc cgacaagtac 1740 
acctcctccg acccgctctg gtacgtgacc gtgaccctcc cggccggcga gtccttcgag 1800 
tacaagttca tccgcatcga gtccgacgac tccgtggagt gggagtccga cccgaaccgc 1860 
gagtacaccg tgccgcaggc ctgeggcacc tccaccgcca ccgtgaccga cacctggcgc 1920 



<210> 60 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> synthetic 
<400> 60 

Ser Glu Lys Asp Glu Leu 
1 5 



<210> 61 

<211> 561 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Xylanase BD7436 



<220> 

<221> CDS 

<222> (1) . . (561) 

<400> 61 

atg get age acc ttc tac tgg cat ttg tgg acc gac ggc ate ggc acc 48 

Met Ala Ser Thr Phe Tyr Trp His Leu Trp Thr Asp Gly He Gly Thr 
15 10 15 
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gtg aac get acc aac ggc age gac ggc aac tac age gtg age tgg age 96 

Val Asn Ala Thr Asn Gly Ser Asp Gly Asn Tyr Ser Val Ser Trp Ser 

20 25 30 

aac tgc ggc aac ttc gtg gtg ggc aag ggc tgg acc acc ggc age get 144 

Asn Cys Gly Asn Phe Val Val Gly Lys Gly Trp Thr Thr Gly Ser Ala 

3 5 4 0 4 5 



ace agg gtg ate aac tac aac get cat get ttc age gtg gtg ggc aac 
Thr Arg Val lie Asn Tyr Asn Ala His Ala Phe Ser Val Val Gly Asn 
50 55 60 

get tac ttg get ttg tac ggc tgg acc agg aac age ttg ate gag tac 
Ala Tyr Leu Ala Leu Tyr Gly Trp Thr Arg Asn Ser Leu lie Glu Tyr 
65 70 75 80 

tac gtg gtg gac age tgg ggc acc tac agg cca acc ggc acc tac aag 
Tyr Val Val Asp Ser Trp Gly Thr Tyr Arg Pro Thr Gly Thr Tyr Lys 

85 90 95 

ggc acc gtg acc age gac ggc ggc acc tac gac ate tac acc acc acc 
Gly Thr Val Thr Ser Asp Gly Gly Thr Tyr Asp lie Tyr Thr Thr Thr 

100 105 110 

agg acc aac get cca age ate gac ggc aac aac acc acc ttc acc caa 
Arg Thr Asn Ala Pro Ser He Asp Gly Asn Asn Thr Thr Phe Thr Gin 
115 120 125 

ttc tgg age gtg agg caa age aag agg cca ate ggc acc aac aac acc 
Phe Trp Ser Val Arg Gin Ser Lys Arg Pro He Gly Thr Asn Asn Thr 
130 135 140 

ate acc ttc age aac cat gtg aac get tgg aag age aag ggc atg aac 
He Thr Phe Ser Asn His Val Asn Ala Trp Lys Ser Lys Gly Met Asn 
145 150 155 160 

ttg ggc age age tgg age tac caa gtg ttg get acc gag ggc tac caa 
Leu Gly Ser Ser Trp Ser Tyr Gin Val Leu Ala Thr Glu Gly Tyr Gin 

165 170 175 

age age ggc tac age aac gtg acc gtg tgg tag 
Ser Ser Gly Tyr Ser Asn Val Thr Val Trp 

185 





180 


<210> 


62 


<211> 


186 


<212> 


PRT 


<213> 


Artificial Sequence 


<220> 




<223> 


Synthetic Construct 


<400> 


62 



Met Ala Ser Thr Phe Tyr Trp His Leu Trp Thr Asp Gly He Gly Thr 
15 10 15 



192 



240 



288 



336 



384 



432 



480 



528 



561 



Page 61 



WO 2005/096804 



PCT/US2004/007182 



Val Asn Ala Thr Asn Gly Ser Asp Gly Asn Tyr Ser Val Ser Trp Ser 

20 25 30 

Asn Cys Gly Asn Phe Val Val Gly Lys Gly Trp Thr Thr Gly Ser Ala 
35 40 45 

Thr Arg Val He Asn Tyr Asn Ala His Ala Phe Ser Val Val Gly Asn 
50 55 60 

Ala Tyr Leu Ala Leu Tyr Gly Trp Thr Arg Asn Ser Leu He Glu Tyr 
65 70 75 80 

Tyr Val Val Asp Ser Trp Gly Thr Tyr Arg Pro Thr Gly Thr Tyr Lys 

85 90 95 

Gly Thr Val Thr Ser Asp Gly Gly Thr Tyr Asp He Tyr Thr Thr Thr 

100 105 110 

Arg Thr Asn Ala Pro Ser He Asp Gly Asn Asn Thr Thr Phe Thr Gin 
115 120 125 

Phe Trp Ser Val Arg Gin Ser Lys Arg Pro He Gly Thr Asn Asn Thr 
130 1 35 14*0 

lie Thr Phe Ser Asn His Val Asn Ala Trp Lys Ser Lys Gly Met Asn 
145 150 155 160 

Leu Gly Ser Ser Trp Ser Tyr Gin Val Leu Ala Thr Glu Gly Tyr Gin 

165 170 175 



Ser Ser Gly Tyr Ser Asn Val Thr Val Trp 

180 185 



<210> 63 

<211> 561 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Xylanase BD6002A 



<220> 

<221> CDS 

<222> (1) . . (561) 
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<400> 63 

atg get age acc gac tac tgg caa aac tgg acc gac ggc ggc ggc acc 48 

Met Ala Ser Thr Asp Tyr Trp Gin Asn Trp Thr Asp Gly Gly Gly Thr 
15 10 15 

gtg aac get acc aac ggc age gac ggc aac tac age gtg age tgg age 96 
Vai Asn Ala Thr Asn Gly Ser Asp Gly Asn Tyr Ser Val Ser Trp Ser 

20 25 30 

aac tgc ggc aac ttc gtg gtg ggc aag ggc tgg acc acc ggc age get 144 
Asn Cys Gly Asn Phe Val Val Gly Lys Gly Trp Thr Thr Gly Ser Ala 
35 40 45 

acc agg gtg ate aac tac aac get ggc get ttc age cca age ggc aac 192 
Thr Arg Val lie Asn Tyr Asn Ala Gly Ala Phe Ser Pro Ser Gly Asn 
50 55 60 

ggc tac ttg get ttg tac ggc tgg acc agg aac age ttg ate gag tac 240 
Gly Tyr Leu Ala Leu Tyr Gly Trp Thr Arg Asn Ser Leu lie Glu Tyr 
65 70 75 80 

tac gtg gtg gac age tgg ggc acc tac agg cca acc ggc acc tac aag 288 
Tyr Val Val Asp Ser Trp Gly Thr Tyr Arg Pro Thr Gly Thr Tyr Lys 

8 5 90 95 

ggc acc gtg acc age gac ggc ggc acc tac gac ate tac acc acc acc 336 
Gly Thr Val Thr Ser Asp Gly Gly Thr Tyr Asp He Tyr Thr Thr Thr 

100 105 110 

aqg acc aac get cca age ate gac ggc aac aac acc acc ttc acc caa 384 
Arg Thr Asn Ala Pro Ser He Asp Gly Asn Asn Thr Thr Phe Thr Gin 
115 120 125 

ttc tgg age gtg agg caa age aag agg cca ate ggc acc aac aac acc 4 32 

Phe Trp Ser Val Arg Gin Ser Lys Arg Pro He Gly Thr Asn Asn Thr 
130 135 140 

ate acc ttc age aac cat gtg aac get tgg aag age aag ggc atg aac 480 
He Thr Phe Ser Asn His Val Asn Ala Trp Lys Ser Lys Gly Met Asn 
145 150 155 160 

ttg ggc age age tgg age tac caa gtg ttg get acc gag ggc tac caa 528 
Leu Gly Ser Ser Trp Ser Tyr Gin Val Leu Ala Thr Glu Gly Tyr Gin 

165 170 175 

age age ggc tac age aac gtg acc gtg tgg tag 561 
Ser Ser Gly Tyr Ser Asn Val Thr Val Trp 

18 0 18 5 



<210> 64 

< 2 1 1 > 18 6 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 
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<400> 64 



Met Ala Ser Thr Asp Tyr Trp Gin Asn Trp Thr Asp Gly Gly Gly Thr 
15 10 15 

Val Asn Ala Thr Asn Gly Ser Asp Gly Asn Tyr Ser Val Ser Trp Ser 

20 25 30 

Asn Cys Gly Asn Phe Val Val Gly Lys Gly Trp Thr Thr Gly Ser Ala 
35 40 45 

Thr Arg Val He Asn Tyr Asn Ala Gly Ala Phe Ser Pro Ser Gly Asn 
50 55 60 

Gly Tyr Leu Ala Leu Tyr Gly Trp Thr Arg Asn Ser Leu He Glu Tyr 
65 70 75 80 

Tyr Val Val Asp Ser Trp Gly Thr Tyr Arg Pro Thr Gly Thr Tyr Lys 

85 90 95 

Gly Thr Val Thr Ser Asp Gly Gly Thr Tyr Asp He Tyr Thr Thr Thr 

100 105 ' 110 

Arg Thr Asn Ala Pro Ser He Asp Gly Asn Asn Thr Thr Phe Thr Gin 
115 120 125 

Phe Trp Ser Val Arg Gin Ser Lys Arg Pro He Gly Thr Asn Asn Thr 
130 135 140 

lie Thr Phe Ser Asn His Val Asn Ala Trp Lys Ser Lys Gly Met Asn 
145 150 155 160 

Leu Gly Ser Ser Trp Ser Tyr Gin Val Leu Ala Thr Glu Gly Tyr Gin 

165 170 175 



Ser Ser Gly Tyr Ser Asn Val Thr Val Trp 

180 185 



<210> 65 

<211> 561 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Xylanase BD6002B 
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<220> 

<221> CDS 

<222> (1) . . (561) 

<400> 65 

atg gcc tec acc gac tac tgg cag aac tgg acc gac ggc ggc ggc acc 48 
Met Ala Ser Thr Asp Tyr Trp Gin Asn Trp Thr Asp Gly Gly Gly Thr 

15 10 15 

gtg aac gcc acc aac ggc tec gac ggc aac tac tec gtg tec tgg tec 96 
Val Asn Ala Thr Asn Gly Ser Asp Gly Asn Tyr Ser Val Ser Trp Ser 

20 25 30 

aac tgc ggc aac ttc gtg gtg ggc aag ggc tgg acc acc ggc tec gcc 144 
Asn Cys Gly Asn Phe Val Val Gly Lys Gly Trp Thr Thr Gly Ser Ala 
35 40 45 

acc cgc gtg ate aac tac aac gcc ggc gcc ttc tec ccg tec ggc aac 192 
Thr Arg Val lie Asn Tyr Asn Ala Gly Ala Phe Ser Pro Ser Gly Asn 
50 55 60 

ggc tac etc gcc etc tac gcc tgg acc cgc aac tec etc ate gag tac 240 
Gly Tyr Leu Ala Leu Tyr Gly Trp Thr Arg Asn Ser Leu lie Glu Tyr 
65 70 75 80 



tac gtg gtg gac tec tgg ggc acc tac cgc ccg acc ggc acc tac aag 
Tyr Val Val Asp Ser Trp Gly Thr Tyr Arg Pro Thr Gly Thr Tyr Lys 

85 90 95 



288 



ggc acc gtg acc tec gac ggc ggc acc tac gac ate tac acc acc acc 336 

Gly Thr Val Thr Ser Asp Gly Gly Thr Tyr Asp lie Tyr Thr Thr Thr 

100 105 110 

cgc acc aac gcc ccg tec ate gac ggc aac aac acc acc ttc acc cag 384 

Arg Thr Asn Ala Pro Ser He Asp Gly Asn Asn Thr Thr Phe Thr Gin 
115 120 125 

ttc tgg tec gtg cgc cag tec aag cgc ccg ate ggc acc aac aac acc 432 

Phe Trp Ser Val Arg Gin Ser Lys Arg Pro He Gly Thr Asn Asn Thr 
130 135 140 

ate acc ttc tec aac cac gtg aac gcc tgg aag tec aag ggc atg aac 480 

He Thr Phe Ser Asn His Val Asn Ala Trp Lys Ser Lys Gly Met Asn 

145 150 155 160 

etc ggc tec tec tgg tec tac cag gtg etc gcc acc gag ggc tac cag 528 

Leu Gly Ser Ser Trp Ser Tyr Gin Val Leu Ala Thr Glu Gly Tyr Gin 

165 170 175 

tec tec ggc tac tec aac gtg acc gtg tgg tga 561 
Ser Ser Gly Tyr Ser Asn Val Thr Val Trp 

180 18 5 



<210> 66 
<211> 186 
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<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 
<400> 66 

Met Ala Ser Thr Asp Tyr Trp Gin Asn Trp Thr Asp Gly Gly Gly Thr 
15 10 15 

Val Asn Ala Thr Asn Gly Ser Asp Gly Asn Tyr Ser Val Ser Trp Ser 

20 25 30 

Asn Cys Gly Asn Phe Val Val Gly Lys Gly Trp Thr Thr Gly Ser Ala 
35 40 45 

Thr Arg Val lie Asn Tyr Asn Ala Gly Ala Phe Ser Pro Ser Gly Asn 
50 55 60 

Gly Tyr Leu Ala Leu Tyr Gly Trp Thr Arg Asn Ser Leu He Glu Tyr 
65 70 75 80 

Tyr Val Val Asp Ser Trp Gly Thr Tyr Arg Pro Thr Gly Thr Tyr Lys 

85 90 95 

Gly Thr Val Thr Ser Asp Gly Gly Thr Tyr Asp lie Tyr Thr Thr Thr 

100 105 110 

Arg Thr Asn Ala Pro Ser He Asp Gly Asn Asn Thr Thr Phe Thr Gin 
115 120 125 

Phe Trp Ser Val Arg Gin Ser Lys Arg Pro He Gly Thr Asn Asn Thr 
130 135 140 

He Thr Phe Ser Asn His Val Asn Ala Trp Lys Ser Lys Gly Met Asn 
145 150 155 160 

Leu Gly Ser Ser Trp Ser Tyr Gin Val Leu Ala Thr Glu Gly Tyr Gin 

165 170 175 



Ser Ser Gly Tyr Ser Asn Val Thr Val Trp 

180 185 



<210> 67 
<211> 2071 
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<212> DNA 

<213> Oryza sativa 

<220> 

<221> mi sc_f eature 

<222> ( 1 ) . . (2071 ) 

<223> Promoter 

<400> 67 



tcca tgctgt 


cct ac tact t 


get tcatccc 


ct t ct acat t 


1 1 gt tctggt 


ttt t ggee t g 


60 


cat t tcgga t 


cat gatgtat 


gt gat t tcca 


atetgetgea 


atatgaa tgg 


agact ct gtg 


12 0 


ct aacca tea 


acaaca t gaa 


atgettat ga 


ggect t tget 


gagcagecaa 


t ct tgect gt 


180 


gtttatgtct 


t cacaggccg 


aat t cctctg 


ttttgttttt 


caccc tcaat 


a 1 1 tggaaac 


2 4 0 


atttatctag 


gt t gt 1 1 gtg 


t ccaggccta 


taaa tcatac 


atgatgttgt 


cgt a 1 1 gga t 


300 


gtgaa tgtgg 


tggcgt gt t c 


agt gect t gg 


a 1 1 tgagtt t 


ga t gagag 1 1 


get t ct ggg t 


*5 n 


caccac t ca c 


cat t a t cga t 


gctcct cttc 


agcat aagg t 


aaaagt ct t c 


cctgt t tacg 


/ion 

4 Z. U 


ttat tttacc 


cacta tggtt 


get tgggt tg 


gt tttttcct 


ga 1 1 get tat 


gecat ggaaa 


4 o 0 


gtcatttgat 


atgt tgaact 


tgaa 1 1 aact 


gt agaa 1 1 gt 


at acatgt t c 


catt tgtgt t 


D 4 0 


gtact t cctt 


cttttctatt 


ag t agcet ca 


ga t gag tgt g 


aaaaaaacag 


a ttatataac 


oUU 


t tgeccta ta 


aatcat t tga 


aaaaaatatt 


g t acag tgag 


aaattgatat 


atagtgaatt. 


^ n 

DuU 


1 1 1 a agagca 


tgt 1 1 1 cct a 


aagaagt at a 


tattttctat 


gtacaaaggc 


ca l igadgLd 


i ? n 


1 L L - _ -1 — 

attgtagata 


ca gga t a a t g 


tagacttttt 


gga c L t aca c 


tgetaect l t 


dayLddCad L 


7 ft Pi 
/ o U 


catqagcaat 


agtg 1 1 gcaa 


tgatatttag 


get gca t teg 


l t tact ct ct 


tgacttccat 




gagcaege 1 1 


cccaaa ct gt 


taaact ct gt 


^■4- 4—4-4—4-4— ^ » 

gttttttgee 


aa aaaaaaa t 


gcataggaaa 




g 1 t get 1 1 ta 


aaaaat ca t a 


tcaa t ccatt 


ttttaagt ta 


tagctaa tac 


ttaat taat c 


960 


a t gcgctaat 


aagt cactct 


gtt tt tegta 


ctagagagat 


tgttt tgaac 


cagcact caa 


1020 


gaacacagcc 


t taacccagc 


caaataa tgc 


t acaacctac 


cagt ccacac 


ct ct tgtaaa 


1080 


gca 1 1 1 gt tg 


ca tggaaaag 


ctaagatgac 


agcaacct g t 


t caggaaaac 


aactgacaag 


1140 


gt ca taggga 


gagggagct t 


1 1 ggaaaggt 


geegt gcagt 


tcaaacaat t 


agt tagcagt 


1200 


aggg t gt t gg 


ttt ttgetea 


cagcaa taag 


aagt taatca 


tggtgtaggc 


aaccca aa t a 


1260 


aaacaccaaa 


ata tgcacaa 


ggcagt t tgt 


tgtat tctgt 


agtacagaca 


aaactaaaag 


1320 


taa tgaaaga 


aga tgtggtg 


t tagaaaagg 


aaacaa ta tc 


atgagtaa tg 


tgtgggca 1 1 


1380 


a t gggaccac 


gaaa t aaaaa 


gaaeat tt tg 


atgagtcgt g 


ta tcctcgat 


gagectcaaa 


1440 
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agttctctca 


ccccggataa 


gaaaccct t a 


agcaatgt gc 


aaagtt tgea 


ttct ccactg 


1500 


acat aatgca 


aaa t aaga t a 


tea tcga tga 


ca tagcaact 


catgea t ca t 


atcat gect c 


1560 


t ctcaacct a 


t tea 1 1 ccta 


ctcat ctaca 


taagtatct t 


cagctaaatg 


1 1 agaacata 


1620 


aaccca taag 


teaegt 1 1 ga 


tgagt attag 


gcgt gacaca 


tgacaaa tea 


cagactcaag 


1680 


caaga taaag 


caaaa tga tg 


tgtacataaa 


act ccagaqc 


tatatgt cat 


a t tgcaaaaa 


•1740 


gaggagagct 


t a taagacaa 


ggca tgactc 


acaaaaat tc 


att tgccttt 


eg tgt caaaa 


1800 


agaggagggc 


tt tacat ta t 


cca t gt ca t a 


t tgcaaaaga 


aagagagaaa 


gaacaacaca 


1860 


a tgctgcgtc 


aattatacat 


a t ctgtatgt 


ccatcat tat 


tea tccacct 


t tcgtgtacc 


1920 


acacttcata 


ta tea tgagt 


cact tcatgt 


ctggacat ta 


acaaactct a 


tcttaacat t 


1980 


taga tgcaag 


agect tt ate 


tcactat aaa 


t gcacgatga 


t ttctcattg 


t ttctcacaa 


2040 


aaagcat tea 


gt tcatt agt 


cc tacaacaa 


c 






2071 



<210> 68 

<211> 79 

<212> PRT 

<213> Zea mays 



<220> 

<221> SIGNAL 
<222> ( 1) . . (79) 

<223> Maize waxy signal sequence. 
<400> 68 

Met Leu Ala Ala Leu Ala Thr Ser Gin Leu Val Ala Thr Arg Ala Gly 
15 10 15 



Leu Gly Val Pro Asp Ala Ser Thr Phe Arg Arg Gly Ala Ala Gin Gly 

20 ' 25 30 



Leu Arg Gly Ala Arg Ala Ser Ala Ala Ala Asp Thr Leu Ser Met Arg 
35 40 45 



Thr Ser Ala Arg Ala Ala Pro Arg His Gin His Gin Gin Ala Arg Arg 

5 0 5 5 60 



Gly Ala Arg Phe Pro Ser Leu Val Val Cys Ala Ser Ala Gly Ala 
65 " 70 75 



<210> 69 
<211> 1005 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Bromelain Sequence 



<220> 

<22i> CDS 

<222> (1) . . (1005) 

<223> Synthetic Bromelain 

<400> 69 

atg gcc tgg aag gtg cag gtg gtg ttc etc ttc etc ttc etc tgc gtg 

Met Ala Trp Lys Val Gin Val Val Phe Leu Phe Leu Phe Leu Cys Val 
15 10 15 

atg tgg gcc tec ccg tec gcc gcc tec gcg gac gag ccg tec gac ccg 
Met Trp Ala Ser Pro Ser Ala Ala Ser Ala Asp Glu Pro Ser Asp Pro 

20 25 30 

atg atg aag cgc ttc gag gag tgg atg gtg gag tac ggc cgc gtg tac 
Met Met Lys Arg Phe Glu Glu Trp Met Val Glu Tyr Gly Arg Val Tyr 
35 40 45 



etc ggc ate aac cag ttc ace gac atg acc aac aac gag ttc ate gcc 
Leu Gly lie Asn Gin Phe Thr Asp Met Thr Asn Asn Glu Phe lie Ala 

85 90 95 

cag tac acc ggc ggc ate tec cgc ccg etc aac ate gag cgc gag ccg 
Gin Tyr Thr Gly Gly lie Ser Arg Pro Leu Asn lie Glu Arg Glu Pro 

100 " 105 110 



48 



96 



144 



aag gac aac gac gag aag atg cgc cgc ttc cag ate ttc aag aac aac 192 

Lys Asp Asn Asp Glu Lys Met Arg Arg Phe Gin lie Phe Lys Asn Asn 

50 55 60 

gtg aac cac ate gag acc ttc aac tec cgc aac gag aac tec tac acc 240 

Val Asn His lie Glu Thr Phe Asn Ser Arg Asn Glu Asn Ser Tyr Thr 

65 70 75 80 



288 



336 



gtg gtg tec ttc gac gac gtg gac ate tec gcc gtg ccg cag tec ate 384 
Val Val Ser Phe Asp Asp Val Asp lie Ser Ala Val Pro Gin Ser lie 
115 120 125 

gac tgg cgc gac tac ggc gcc gtg acc tec gtg aag aac cag aac ccg 432 
Asp Trp Arg Asp Tyr Gly Ala Val Thr Ser Val Lys Asn Gin Asn Pro 
130 135 140 

tgc ggc gcc tgc tgg gcc ttc gcc gcc ate gcc acc gtg gag tec ate 480 
Cys Gly Ala Cys Trp Ala Phe Ala Ala lie Ala Thr Val Glu Ser lie 
145 150 155 160 

tac aag ate aag aag ggc ate etc gag ccg etc tec gag cag cag gtg 528 
Tyr Lys He Lys Lys Gly He Leu Glu Pro Leu Ser Glu Gin Gin Val 

165 170 175 

etc gac tgc gcc aag ggc tac ggc tgc aag ggc ggc tgg gag ttc cgc 576 
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Leu Asp Cys Ala Lys Gly Tyr Gly Cys Lys Gly Gly Trp Glu Phe Arg 

180 185 190 

gcc ttc gag ttc ate ate tec aac aag ggc gtg gee tec ggc gee ate 
Ala Phe Glu Phe He He Ser Asn Lys Gly Val Ala Ser Gly Ala He 
195 200 205 

tac ccg tac aag gcc gcc aag ggc acc tgc aag acc gac ggc gtg ccg 
Tvr Pro Tyr Lys Ala Ala Lys Gly Thr Cys Lys Thr Asp Gly Val Pro 
210 215 220 

aac tec gcc tac ate acc ggc tac gcc cgc gtg ccg cgc aac aac gag 
Asn Ser Ala Tyr lie Thr Gly Tyr Ala Arg Val Pro Arg Asn Asn Glu 
225 230 235 240 

tec tec atg atg tac gcc gtg tec aag cag ccg ate acc gtg gcc gtg 
Ser Ser Met Met Tyr Ala Val Ser Lys Gin Pro lie Thr Val Ala Val 

" ' " 24 5 250 255 

gac gcc aac gcc aac ttc cag tac tac aag tec ggc gtg ttc aac ggc 
Asp Ala Asn Ala Asn Phe Gin Tyr Tyr Lys Ser Gly Val Phe Asn Gly 

260 265 270 

ccq tac gac acc tec etc aac cac gcc gtg acc gcc ate ggc tac ggc 
Pro Cvs Gly Thr Ser Leu Asn His Ala Val Thr Ala lie Gly Tyr Gly 
275 280 285 

caq gac tec ate ate tac ccg aag aag tgg ggc gcc aag tgg ggc gag 
Gin Asp Ser lie lie Tyr Pro Lys Lys Trp Gly Ala Lys Trp Gly Glu 
290 295 300 

gcc ggc tac ate cgc atg gcc cgc gac gtg tec tec tec tec ggc ate 
Ala Gly Tyr lie Arg Met Ala Arg Asp Val Ser Ser Ser Ser Gly He 
305 310 315 320 

tgc ggc ate gcc ate gac ccg etc tac ccg acc etc gag gag tag 
Cys Gly He Ala He Asp Pro Leu Tyr Pro Thr Leu Glu Glu 

325 330 



<210> 


70 


<2 1 1 > 


334 


<212> 


PRT 


<213> 


Artificial Sequence 


<220> 




<223> 


Synthet ic Construct 


<400> 


70 



Met Ala Trp Lys Val Gin Val Val Phe Leu Phe Leu Phe Leu Cys Val 

x 5 10 15 

Met Trp Ala Ser Pro Ser Ala Ala Ser Ala Asp Glu Pro Ser Asp Pro 

20 25 30 



624 



672 



720 



768 



816 



864 



912 



960 



1005 



Page 70 



WO 2005/096804 



PCT/US2004/007182 



Met Met Lys Arg Phe Glu Glu Trp Met Val Glu Tyr Gly Arg Val Tyr 
35 40 45 

Lys Asp Asn Asp Glu Lys Met Arg Arg Phe Gin He Phe Lys Asn Asn 
50 55 60 

Val Asn His He Glu Thr Phe Asn Ser Arg Asn Glu Asn Ser Tyr Thr 
65 70 75 80 

Leu Gly He Asn Gin Phe Thr Asp Met Thr Asn Asn Glu Phe He Ala 

85 90 95 

Gin Tyr Thr Gly Gly He Ser Arg Pro Leu Asn lie Glu Arg Glu Pro 

100 105 110 

Val Val Ser Phe Asp Asp Val Asp He Ser Ala Val Pro Gin Ser lie 
115 120 125 

Asp Trp Arg Asp Tyr Gly Ala Val Thr Ser Val Lys Asn Gin Asn Pro 
130 135 140 

Cvs Gly Ala Cys Trp Ala Phe Ala Ala He Ala Thr Val Glu Ser He 
145 150 155 160 

Tyr Lys He Lys Lys Gly He Leu Glu Pro Leu Ser Glu Gin Gin Val 

165 170 175 

Leu Asp Cys Ala Lys Gly Tyr Gly Cys Lys Gly Gly Trp Glu Phe Arg 

180 185 190 

Ala Phe Glu Phe lie lie Ser Asn Lys Gly Val Ala Ser Gly Ala lie 
195 200 205 

Tyr Pro Tyr Lys Ala Ala Lys Gly Thr Cys Lys Thr Asp Gly Val Pro 
210 215 220 

Asn Ser Ala Tyr lie Thr Gly Tyr Ala Arg Val Pro Arg Asn Asn Glu 
225 230 235 240 

Ser Ser Met Met Tyr Ala Val Ser Lys Gin Pro lie Thr Val Ala Val 

245 250 255 

Asp Ala Asn Ala Asn Phe Gin Tyr Tyr Lys Ser Gly Val Phe Asn Gly 

260 265 270 
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Pro Cvs Gly Thr Ser Leu Asn His Ala Val Thr Ala lie Gly Tyr Gly 
275 280 285 

Gin Asp Ser lie He Tyr Pro Lys Lys Trp Gly Ala Lys Trp Gly Glu 
290 295 300 

Ala Gly Tyr lie Arg Met Ala Arg Asp Val Ser Ser Ser Ser Gly lie 
305 310 315 320 

Cys Gly lie Ala He Asp Pro Leu Tyr Pro Thr Leu Glu Glu 

32 5 330 



<210> 71 
<211> 78 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Bromealin signal sequence 
<400> 71 

atggcctgga aggtgcaggt ggtgttcctc ttcctcttcc tctgcgtgat gtgggcctcc 
ccgtccgccg cctccgcc 

<210> 72 

<211> 26 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Bromealin signal peptide 

<400> 72 

Met Ala Trp Lys Val Gin Val Val Phe Leu Phe Leu Phe Leu Cys Val 
1 5 10 IS 

Met Trp Ala Ser Pro Ser Ala Ala Ser Ala 

20 25 



<210> 73 

<211> 1050 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> pSYNHOOO 



60 
78 
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< 4 0 0> 7 3 
a t ggcctgga 


aggtgeagg t 


ggtgt tcctc 


t tcctct tec 


tetgegtgat 


gt gggcctcc 


60 


ccg tccgccg 


cct ccgcgga 


cgagecg tec 


gacccgat ga 


tgaageget t 


cgaggagtgg 


120 


a t ggt ggagt 


acggccgcgt 


gt acaaggac 


aacgacgaga 


aga t gcgccg 


ct tccaga tc 


180 


1 1 caagaaca 


aegtgaacca 


ca t cgagacc 


t tcaactccc 


gcaacgagaa 


ct cct acacc 


240 


ctcggcat ca 


accagt t cac 


cgaca tgacc 


aacaacgagt 


tcatcgccca 


gtacaccggc 


• 300 


ggca t c t ccc 


gcccgct caa 


ca tegagege 


gagccggtgg 


tgtcct t cga 


cgacgtggac 


360 


a tccccgccg 


tgccgcagt c 


ca tcgactgg 


cgcgactacg 


gcgccg tgac 


ctccgtgaag 


420 


aaccagaacc 


eg tgeggege 


ct getgggee 


1 1 cgccgcca 


tcgccaccgt 


ggagt ccatc 


480 


tacaagat ca 


agaagggca t 


cc tegagecg 


ct ctccgagc 


ageaggtget 


cgactgcgcc 


540 


aagggct acg 


getgeaaggg 


egget gggag 


t tccgcgcct 


tcgagt teat 


catct ccaac 


600 


aagggcgtgg 


cct ccggcgc 


ca tctacccg 


tacaaggecg 


ccaagggcac 


c t gcaagacc 


660 


gacggcgtgc 


cgaactccgc 


ct aca tcacc 


ggctacgccc 


gcgtgccgcg 


caacaacgag 


720 


tcctccat ga 


tgtacgccgt 


gt ccaagcag 


ccgat caccg 


tggccgtgga 


cgccaacgcc 


780 


aact t ccagt 


act acaaqtc 


cggcgtgt t c 


aacggcccgt 


gcggcacct c 


cctcaaccac 


840 


gccg t gaccg 


ccatcggcta 


cggccaggac 


LCCaiCdLCt 




y '-yyyy^-y — ^ 


900 


aagtggggcg 


aggceggcta 


catccgcatg 


gcccgcgacg 


t gt cct cctc 


ctccggca tc 


960 


tgcggcat eg 


cca t cgaccc 


gctctacccg 


accctcgagg 


aggtgtt cgc 


egaggeca tc 


1020 


gccgccaact 


ccaccct cgt 


ggccgagt ag 


• 






1050 


<210> 74 
<211> 1067 
<212> DNA 

<213> Artificial Sequence 










<220> 

<223> pSYN11589 












<400> 74 
tggcctggaa 


ggtgcaggtg 


gt gt tcctct 


t cctct t cct 


ctgcgtgatg 


tgggcctccc 


60 


eg tccgccgc 


ctccgcctcc 


tcctcctcct 


t cgccgact c 


caacccga tc 


cgcccggtga 


120 


ccgaccgcgc 


cgect ccacc 


gaegagcegt 


ccgacccga t 


ga t gaagege 


t tcgaggagt 


180 


gga tggtgga 


g tacggccgc 


gtgtacaagg 


acaacgacga 


gaagatgege 


cget tccaga 


240 


t ct tcaagaa 


caa eg tgaac 


caca tcgaga 


cct tcaact c 


ccgcaacgag 


aactcctaca 


300 


ccctcggcat 


caaccagt t c 


accgaca tga 


ccaacaacga 


gttcatcgcc 


cag tacaccg 


360 



Page 73 



WO 2005/096804 



PCT/US2004/007182 



gcggca tctc 


ccgcccgct c 


aacatcgagc 




nrtt" n t" rrt" t~ c 


oaoaacatLGO 

\mmj t-4 V< N-j U NhJ ^-H 


4 20 


aca t ct ccgc 


cgtgccgcag 


tcca t cga c t 


ggegegacta 


f*i /*t * n t~ pi 


o t V— ^ ^ V- ^ CJ 


4 80 


agaaccaqaa 


cccgtgcggc 


gcct gc tggg 


cct t cgccgc 


ca ccycLdou 


nt nn a n f pra 


5 4 0 


t ctacaaga t 


caagaagggc 


a t cct cgagc 


cgc t ct ccga 


gcagcaggtg 


l. LLijdt Lyuy 


600 
\j \j \j 


ccaagggcta 


cggctgcaag 


ggcggctggg 


agt t ccgcgc 


cttcgagttc 


at" pat'pt'r'pa 


6 60 
\j \j \j 


acaagggcg t 


ggcct ccggc 


1 ■ ____ J m ^ J»» Jl !■ 

gcca tctacc 


eg tacaagg c 


cyccddygyL 




7 20 


ccgacggcgt 


gccgaact cc 


gcct aca tea 


cegget a cgc 


ccgcgtgccg 




7 80 


aqtcctccat 


qatgtacgcc 


gtgt ccaagc 


agccgatcac 


cgtggccgtg 


gacgccaacg 


840 


ccaacz tcca 


gt act acaag 


t ccggcgtgt 


t caacggccc 


gt gcggcacc 


tccctcaacc 


900 


acgccg tgac 


cgcca t cggc 


taeggecagg 


actcca tea t 


ct acccgaag 


aagtggggcg 


960 


ccaagtgggg 


cgaggccggc 


tacatccgca 




pnt at rrt nr. 


tcctccaaca 


1020 


tct gcggcat 


cgcca tcgac 


ccgctc tacc 


cgaccctcga 


ggagtag 




1067 


<210> 75 
<211> 1023 
<212> DNA 

<213> Artificial Sequence 










<220> 

<223> pSYN11587 Sequence 










<400> 75 
atggcct gga 


aggtgcaggt 


ggtgtt cctc 






nt nnacctcc 


60 


ccg tccgccg 


cctccgcgga 


cgagccgtcc 




t'CiaFlCMTJCt t" 


raaaaaatoo 


120 


atggtggagt 


acggccgcgt 


gt acaaggac 


aacgacgaga 


a y a Lycy t ty 




180 


1 1 caagaaca 


acg t gaa cca 


cat cgagacc 


L tCaaCtCCC 


yuadLyay aa 


r* 1" r*r"t ararr 


240 


c tcggca t ca 


accag 1 1 cac 


cgaca tgacc 


aacaacgagi 


Ltd LC-y^fwd 




300 


ggca t ct ccc 


gcccgct caa 


ca tegagege 


gagcegg ig g 




caacataaac 

V_* \*4 l—A \u» U >^4 W 


360 


a t ct ccgccg 


t gccgcagtc 


catcgactgg 




'sj o n • \^ u l u a 


ct ccataaaa 


420 


aaccagaacc 


cgtgcggcgc 


ctgctgggcc 


t t cgccgcca 


tcgccaccgt 


ggagt cca tc 


480 


t acaaga t ca 


agaagggca t 


cct egagecg 


ClLLLLLJayL 




caactacacc 


540 


aagggctacg 


gctgcaaggg 


eggctgggag 


t tccgcgcct 


tcgagttca t 


catctccaac 


600 


aagggcgt gg 


cctccggcgc 


ca tctacccg 


tacaaggecg 


ccaagggcac 


ct gcaagacc 


660 


gacggcg tgc 


cgaactccgc 


ctaca tcacc 


ggctacgccc 
Page 74 


gcgt gecgeg 


caacaacgag 


720 
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t cctcca tga 


tgt aegcegt 


gt ccaagcag 


ccga tcaccg 


tggccgtgga 


cgecaacgcc 


Ton 
/ o U 


aactt ccag t 


actacaagt c 


eggegtg t tc 


aacggcccg t 


gcggcacct c 


cct caaccac 


840 


gccgt gaccg 


ccatcggct a 


cggccaggac 


tccat catct 


a cccgaagaa 


gtggggcgcc 


900 


aagtggggcg 


aggceggcta 


ca t ccgea tg 


gcccgcgacg 


tgtcctcct c 


ctccggca t c 


960 


t gcggca teg 


cca t cqaccc 


get ctacccg 


accct cgagg 


dy tCLgdyad 


yyauyay^/Ly 


1020 


tag 












1023 


<210> 76 
<211> 990 
<212> DNA 

<213> Artificial Sequence 










<220> 

<223> pSYN12169 Sequence 










< 4 0 0 > / o 
a t gagggt gt 


tgctcgt tgc 


cctcgct etc 


ct ggct ct eg 


ct gegagege 


LdLL L Ltd Ly 


60 


gcgga cgagc 


cgtccgaccc 


ga tgatgaag 


cget tcgagg 


agi-ggauyg l 


n o ^ n 1~ pi n rr 
y a y Lauyy ^ 


120 


cgcgt gt aca 


aggacaacga 


cgagaaga tg 


cgccgct tec 


a /^t n t" ^ h f- 2 3 
dydUCL Ltdd 


u a a ^a a ^-y 


180 

-L w" w 


aaccaca teg 


agacct t caa 


ctcccgcaac 


gagaactccL 


ar'ar^^^t" r" n O 

aLduttLLyij 


c p 1 - c a a c c a a 


24 0 


1 1 caccgaca 


t gaccaacaa 


cgagtt catc 


gcccagtaca 


ccgyoy y La l 




300 


ctcaaca t eg 


agcgcgagcc 


ggtggtgt cc 


tiLcgacgacg 


LUUdLd L L 


rncc'Cit acca 

^\J\w\_»y \_ \^ 


360 


cagtcca teg 


actggegega 


ct acggcgcc 


gLgaccLceg 


Ly day aaLL.a 




420 


ggegect get 


gggect tege 


cgccatcgcc 


accguggag l 


LLa LL LaUda 


u u L w a u m u u y 


4 80 


ggcat cctcg 


agccgctctc 


cgagcagcag 


gLgcucgaCL 


gcgcLddyyy 


ct^cncfCtcic 


540 


aagggegget 


gggagt t ccg 


cgccttcgag 


LLCaLCdLtL 


LLaauaayyy 


cataacctcc 


600 


ggegeca t ct 


acccgt acaa 


ggccgccaag 


ggcacu t gca 


dgaCLgdtyy 




660 


t ccgcct aca 


t caccggcta 


cgcccgcgtg 


< — i t~~* /— i 3 ^\ 

CCyCgtadtd 




catoatatac 


720 


gecg tgtcca 


ageagcega t 


caccgtggcc 


gtggacgcca 


acgccaactt 


ceagt actac 


~j o r\ 

7 8 0 


aagtccggcg 


t gt teaaegg 


cccgtgcggc 


acct ccct ca 


accacgccgt 


gaccgcca t c 


840 


ggctacggcc 


aggactccat 


catctacccg 


aagaagtggg 


gcgccaagt g 


gggegaggee 


900 


ggctaca tec 


gca tggcccg 


cgacg tgtcc 


t cctcctccg 


geatctgegg 


ca tcgccatc 


960 


gacccgc t ct 


acccgaccct 


cgaggagtag 








990 
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<210> 77 

<211> 1170 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> pSYN12575 Sequence 



s A 0 0 *> 7 7 

atgctggcgg 


ctct ggecac 


gt cgcagctc 


gtcgcaacgc 


gcgccggcc t 


gggcgt cccg 


■ 60 


gacgcgt cca 


cgt tccgccg 


cggcgccgcg 


cagggectga 


ggggggcccg 


ggcgtcggcg 


120 


gcggcggaca 


cgctcagcat 


gcggaccagc 


gcgcgcgcgg 


cgcccaggca 


ccagcaccag 


180 


caggcgcgcc 


gcggggccag 


gt tcccg teg 


ctegtegt gt 


gcgccagcgc 


cggcgcca t g 


240 


gcggacgagc 


cgt ccgaccc 


ga tgatgaag 


cgcttcgagg 


agtgga tggt 


ggagt aegge 


300 


cgcgtgtaca 


aggacaacga 


cgagaaga tg 


cgccgct t cc 


aga tct tcaa 


gaacaacgtg 


360 


aaccaca teg 


agacct tcaa 


ct cccqcaac 


gagaact cct 


acaccct egg 


catcaaccag 


420 


1 1 caccgaca 


t gaccaacaa 


cgagt tea t c 


geccag taca 


ccggcggca t 


ctcccgcccg 


480 


ct caaca teg 


agcgcgagcc 


ggtggtgtcc 


ttcgacgacg 


tggacatct c 


cgccgtgccg 


540 


cagtccat eg 


actggegega 


ct acggcgcc 


gtgacct ccg 


t gaagaacca 


gaacccgtgc 


600 


ggegect get 


gggee 1 1 cgc 


cgeca tegee 


accgtggagt 


ccatct acaa 


gat caagaag 


660 


ggca tcctcg 


agccgctct e 


cgagcagcag 


gtgetcgact 


gcgccaaggg 


etaegget gc 


720 


aagggegget 


gggag 1 1 ccg 


/~r r> i~ i- r*> n ^ n 

Cycct Lcydy 




ccaacaaaaa 


ca t qqcc t cc 


780 


ggegeca tct 


acccgt acaa 


ggccgccaag 


ggcacctgca 


agaccgaegg 


cgtgccgaac 


840 


tccgcct aca 


tcaccggct a 


cgcccgcgtg 


ccgcgcaaca 


dCyag iccic 




900 


gecg tgt cca 


ageagecgat 


caccgtggcc 


gtggacgcca 


acgccaact t 


ccagtactac 


960 


aagt ccggcg 


tgt t caaegg 


cccgt gegge 


acctccct ca 


accacgccgt 


gaccgccatc 


1020 


ggctacggcc 


aggact cca t 


ca tctacccg 


aagaag tggg 


gcgccaagtg 


gggegaggee 


1080 


ggctaca t cc 


gca t ggcccg 


cgacgtgt cc 


tcct cctccg 


gca t ct gegg 


cat cgccatc 


1140 


gacccgctct 


acccgaccct 


cgaggagt ag 








1170 


<210> 78 
<211> 1068 
<212> DNA 















<213> Artificial Sequence 

<220> 

<223> pSM270 Sequence 
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<400> 78 
a t ggcctgga 


agg tgcaggt 


ggtgt tcctc 


t tcctct tec 


tctgcgtga t 


g t gggcctcc 


60 


ccg tccgccg 


cctccgcct c 


ct cct cct cc 


t tcgccgact 


ccaacccqa t 


ccgcccggtg 


120 


accgaccgcg 


ccgcct ccac 


cgacgagccg 


t ccgacccga 


t gatgaagcg 


ct tcgaggag 


180 


tggatggtgg 


ag tacggccg 


cgtgtacaag 


gacaacgacg 


agaagatgeg 


ccget t ccag 


240 


a t ct t caaga 


acaacgtgaa 


ccaca t cgag 


acct tcaact 


cccgcaacga 


gaact cctac 


300 


accct cggca 


t caaccag 1 1 


caccgacatg 


accaacaacg 


agt t ca tege 


ccagt acacc 


360 


ggcggcat ct 


cccgcccgct 


caacat cgag 


cgcgagccgg 


tggt gtcctt 


cgacgacg tg 


420 


gaca t ct ccg 


ccg t gccgca 


gt ccatcgac 


t ggegegact 


acggcgccgt 


gacct ccgtg 


480 


aagaaccaga 


acccgtgcgg 


cgcctgctgg 


gect t cgccg 


cca t cgccac 


cgtggagtcc 


540 


a tctacaaga 


t caagaaggg 


cat cct cgag 


ccgctct ccg 


agcagcaggt 


gctcgactgc 


600 


gccaagggct 


acggct gcaa 


gggcggctgg 


gagt tecgeg 


cct tcgagtt 


ca teat ctcc 


660 


aacaagggcg 


tggcct ccgg 


cgeca t ct ac 


cegtacaagg 


ccgccaaggg 


cacct gcaag 


720 


accgacggcg 


tgccgaact c 


cgcctaca tc 


acegge t acg 


cccgcgt gec 


gcgcaacaac 


780 


aBat.cct.cc3 


tqat Q t acqc 


cgtgt ccaag 


cagcega tea 


ccgtggccgt 


ggacgccaac 


840 


gccaac 1 1 cc 


agt act acaa 


gtccggcgtg 


1 1 caacggcc 


cgtgcggcac 


ctccctcaac 


900 


cacgccgt ga 


ccgcca t egg 


ct aeggecag 


gact cca t ca 


tctacccgaa 


gaagtggggc 


960 


gccaag tggg 


gegaggcegg 


ct acatccgc 


a tggcccgcg 


acgtgtcct c 


ctcct cegge 


1020 


a t ctgcggca 


t cgeca tcga 


cccgct ct ac 


ccgaccct eg 


aggagt ag 




1068 



<210> 79 

<211> 1497 

<212> DNA 

<213> Trichoderma reesei 



<220> 

<221> CDS 

<222> (1) . . (14 97) 

<223> Trichoderma reesei cellobiohyrodlase I 
<400> 79 

atg cag teg gcg tgt act etc caa teg gag act cac ccg cct ctg aca 
Met Gin Ser Ala Cys Thr Leu Gin Ser Glu Thr His Pro Pro Leu Thr 
15 10 15 

tgg cag aaa tgc teg tct ggt ggc acg tgc act caa cag aca ggc tec 
Trp Gin Lys Cys Ser Ser Gly Gly Thr Cys Thr Gin Gin Thr Gly Ser 

20 25 30 



48 



96 
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gtg gtc ate gac gec aac egg cgc tgg act cac get acg aac age age 

Val Val He Asp Ala Asn Trp Arg Trp Thr His Ala Thr Asn Ser Ser 

35 40 45 

acg aac tgc tac gat ggc aac act tgg age teg acc eta tgt cct gac 

Thr Asn Cys Tyr Asp Gly Asn Thr Trp Ser Ser Thr Leu Cys Pro Asp 

50 ' 55 60 

aac gag acc tgc gcg aag aac tgc tgt ctg gac ggt gee gec tac gcg 

Asn Glu Thr Cys Ala Lys Asn Cys Cys Leu Asp Gly Ala Ala Tyr Ala 

65 70 75 80 

tec acg tac gga gtt acc acg age ggt aac age etc tec att ggc ttt 

Ser Thr Tyr Gly Val Thr Thr Ser Gly Asn Ser Leu Ser He Gly Phe 

85 90 95 

gtc acc cag tct gcg cag aag aac gtt ggc get cgc ctt tac ctt atg 

Val Thr Gin Ser Ala Gin Lys Asn Val Gly Ala Arg Leu Tyr Leu Met 

100 105 HO 

gcg age gac acg acc tac cag gaa ttc acc ctg ctt ggc aac gag ttc 

Ala Ser Asp Thr Thr Tyr Gin Glu Phe Thr Leu Leu Gly Asn Glu Phe 

115 120 125 

tct ttc gat gtt gat gtt teg cag ctg ccg tgc ggc ttg aac gga get 

Ser Phe Asp Val Asp Val Ser Gin Leu Pro Cys Gly Leu Asn Gly Ala 

130 135 140 

etc tac ttc gtg tec atg gac gcg gat ggt ggc gtg age aag tat ccc 

Leu Tyr Phe Val Ser Met Asp Ala Asp Gly Gly Val Ser Lys Tyr Pro 

145 150 155 160 

acc aac acc get ggc gec aag tac ggc acg ggg tac tgt gac age cag 

Thr Asn Thr Ala Gly Ala Lys Tyr Gly Thr Gly Tyr Cys Asp Ser Gin 

165 170 175 

tgt ccc cgc gat ctg aag ttc ate aat ggc cag gee aac gtt gag ggc 

Cvs Pro Arq Asp Leu Lys Phe He Asn Gly Gin Ala Asn Val Glu Gly 

Y 180 185 190 

tgg gag ccg tea tec aac aac gcg aac acg ggc att gga gga cac gga 

Trp Glu Pro Ser Ser Asn Asn Ala Asn Thr Gly He Gly Gly His Gly 

195 200 205 

age tgc tgc tct gag atg gat ate tgg gag gec aac tec ate tec gag 

Ser Cys Cys Ser Glu Met Asp lie Trp Glu Ala Asn Ser He Ser Glu 

210 215 220 

get ctt acc ccc cac cct tgc acg act gtc ggc cag gag ate tgc gag 

Ala Leu Thr Pro His Pro Cys Thr Thr Val Gly Gin Glu He Cys Glu 

225 230 235 240 

ggt gat ggg tgc ggc gga act tac tec gat aac aga tat ggc ggc act 

Gly Asp Gly Cys Gly Gly Thr Tyr Ser Asp Asn Arg Tyr Gly Gly Thr 

245 250 255 

tgc gat ccc gat ggc tgc gac tgg aac cca tac cgc ctg ggc aac acc 

Cys Asp Pro Asp Gly Cys Asp Trp Asn Pro Tyr Arg Leu Gly Asn Thr 



144 



192 



240 



288 



336 



384 



4 32 



480 



c 



28 



576 



624 



672 



720 



768 



816 
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age ttc tac 
Ser Phe Tyr 
275 

ttg acc gt t 
Leu Thr Val 

290 

tat gtc cag 
Tyr Val Gin 
305 

agt tac tct 
Ser Tyr Ser 



gca gaa ttc 
Ala Glu Phe 



ttc aag aag 
Phe Lys Lys 
355 

gat gat tac 
Asp Asp Tyr 
370 

aac gag acc 
Asn Glu Thr 
385 

age tec ggt 
Ser Ser Gly 



gtc acc ttc 
Val Thr Phe 



cct age ggc 
Pro Ser Gly 
435 

acc cgc cgc 
Thr Arg Arg 
450 

tct cac tac 
Ser His Tyr 
465 

tgc gee age 
Cys Ala Ser 



260 

ggc cct ggc 
Gly Pro Gly 



gtc acc cag 
Val Thr Gin 



aat ggc gtc 
Asn Gly Val 
310 

ggc aac gag 
Gly Asn Glu 
325 

ggc gga tec 
Gly Gly Ser 
340 

get acc tct 
Ala Thr Ser 



tac gee aac 
Tyr Ala Asn 



tec tec aca 
Ser Ser Thr 
390 

gtc cct get 
Val Pro Ala 
405 

tec aac ate 

Ser Asn lie 
420 

ggc aac cct 

Gly Asn Pro 



cca gee act 
Pro Ala Thr 



ggc cag tgc 
Gly Gin Cys 
470 

ggc aca act 
Gly Thr Thr 
4 8 5 



265 

tct age ttt 
Ser Ser Phe 
280 

ttc gag acg 

Phe Glu Thr 
295 

act ttc cag 

Thr Phe Gin 



etc aac gat 
Leu Asn Asp 



tct ttc tea 
Ser Phe Ser 
345 

ggc ggc atg 
Gly Gly Met 
360 

atg ctg tgg 
Met Leu Trp 
375 

ccc ggt gee 
Pro Gly Ala 



cag gtc gaa 
Gin Val Glu 



aag ttc gga 
Lys Phe Gly 
425 

ccc ggc gga 
Pro Gly Gly 
440 

acc act gga 
Thr Thr Gly 
455 

ggc ggt att 
Gly Gly lie 



tgc cag gtc 
Cys Gin Val 



acc etc gat 
Thr Leu Asp 



teg ggt gee 
Ser Gly Ala 
300 

cag ccc aac 
Gin Pro Asn 
315 

gat tac tgc 
Asp Tyr Cys 
330 

gac aag ggc 
Asp Lys Gly 



gtt ctg gtc 
Val Leu Val 



ctg gac tec 
Leu Asp Ser 
380 

gtg cgc gga 
Val Arg Gly 
395 

tct cag tct 
Ser Gin Ser 
410 

ccc att ggc 
Pro lie Gly 



aac ccg cct 
Asn Pro Pro 



age tct ccc 
Ser Ser Pro 
460 

ggc tac age 
Gly Tyr Ser 
475 

ctg aac cct 
Leu Asn Pro 
490 



270 

acc acc aag 

Thr Thr Lys 
285 

ate aac cga 

lie Asn Arg 



gee gag ctt 
Ala Glu Leu 



aca get gag 
Thr Ala Glu 
335 

ggc ctg act 
Gly Leu Thr 
350 

atg agt ctg 
Met Ser Leu 

365 

acc tac ccg 
Thr Tyr Pro 



age tgc tec 
Ser Cys Ser 



ccc aac gee 
Pro Asn Ala 
415 

age acc ggc 
Ser Thr Gly 
430 

ggc acc acc 
Gly Thr Thr 
445 

gga cct acc 
Gly Pro Thr 



ggc ccc acg 
Gly Pro Thr 



tac tac tct 
Tyr Tyr Ser 
4 95 



aaa 864 
Lys 

tac 912 
Tyr 



ggt 960 

Gly 

320 

gag 1008 
Glu 



cag 1056 
Gin 



tgg 1104 
Trp 



aca 1152 
Thr 



acc 1200 

Thr 

400 

aag 1248 
Lys 

aac 1296 
Asn 



acc 1344 
Thr 



cag 1392 
Gin 



gtc 1440 

Val 

480 

cag 1488 
Gin 
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tgc ctg taa 

Cys Leu 



1497 



<210> 80 

<211> 498 

<212> PRT 

<213> Trichoderma reesei 

<400> 80 

Met Gin Ser Ala Cys Thr Leu Gin Ser Glu Thr His Pro Pro Leu Thr 
1 5 10 15 

Trp Gin Lys Cys Ser Ser Gly Gly Thr Cys Thr Gin Gin Thr Gly Ser 

20 25 30 

Val Val He Asp Ala Asn Trp Arg Trp Thr His Ala Thr Asn Ser Ser 
35 40 45 

Thr Asn Cys Tyr Asp Gly Asn Thr Trp Ser Ser Thr Leu Cys Pro Asp 
50 55 60 

Asn Glu Thr Cys Ala Lys Asn Cys Cys Leu Asp Gly Ala Ala Tyr Ala 
65 70 75 80 

Ser Thr Tyr Gly Val Thr Thr Ser Gly Asn Ser Leu Ser He Gly Phe 

85 90 95 

Val Thr Gin Ser Ala Gin Lys Asn Val Gly Ala Arg Leu Tyr Leu Met 

100 105 110 

Ala Ser Asp Thr Thr Tyr Gin Glu Phe Thr Leu Leu Gly Asn Glu Phe 
115 120 125 

Ser Phe Asp Val Asp Val Ser Gin Leu Pro Cys Gly Leu Asn Gly Ala 
130 135 140 

Leu Tyr Phe Val Ser Met Asp Ala Asp Gly Gly Val Ser Lys Tyr Pro 
145 150 155 160 

Thr Asn Thr Ala Gly Ala Lys Tyr Gly Thr Gly Tyr Cys Asp Ser Gin 

165 170 175 

Cys Pro Arg Asp Leu Lys Phe He Asn Gly Gin Ala Asn Val Glu Gly 

180 185 190 
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Trp Glu Pro Ser Ser Asn Asn Ala Asn Thr Gly lie Gly Gly His Gly 
195 200 205 

Ser Cvs Cys S^r Glu Met Asp He Trp Glu Ala Asn Ser lie Ser Glu 
210 215 220 

Ala Leu Thr Pro His Pro Cys Thr Thr Val Gly Gin Glu lie Cys Glu 
225 230 235 240 

Gly Asp Gly Cys Gly Gly Thr Tyr Ser Asp Asn Arg Tyr Gly Gly Thr 

245 250 255 

Cys Asp Pro Asp Gly Cys Asp Trp Asn Pro Tyr Arg Leu Gly Asn Thr 

260 265 270 

Ser Phe Tyr Gly Pro Gly Ser Ser Phe Thr Leu Asp Thr Thr Lys Lys 
275 280 285 

Leu Thr Val Val Thr Gin Phe Glu Thr Ser Gly Ala lie Asn Arg Tyr 

290 295 300 

Tvr Val Gin Asn Gly Val Thr Phe Gin Gin Pro Asn Ala Glu Leu Gly 
305 310 315 320 

Ser Tvr Ser Gly Asn Glu Leu Asn Asp Asp Tyr Cys Thr Ala Glu Glu 

325 330 335 

Ala Glu Phe Gly Gly Ser Ser Phe Ser Asp Lys Gly Gly Leu Thr Gin 

340 345 350 

Phe Lys Lys Ala Thr Ser Gly Gly Met Val Leu Val Met Ser Leu Trp 
355 360 365 

Asp Asp Tyr Tyr Ala Asn Met Leu Trp Leu Asp Ser Thr Tyr Pro Thr 
370 375 380 

Asn Glu Thr Ser Ser Thr Pro Gly Ala Val Arg Gly Ser Cys Ser Thr 
385 390 395 400 

Ser Ser Gly Val Pro Ala Gin Val Glu Ser Gin Ser Pro Asn Ala Lys 

4 05 4 10 4 15 
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Val Thr Phe Ser Asn lie Lys Phe Gly Pro He Gly Ser Thr Gly Asn 

420 425 430 

*ro S<=r Gly Gly Asn Pro Pro Gly Gly Asn Pro Pro Gly Thr Thr Thr 
435 ' 445 



Thr Arg Arg Pro Ala Thr Thr Thr Gly Ser Ser Pro Gly Pro Thr Gin 

4 55 460 



4 50 



Ser His Tyr Gly Gin Cys Gly Gly He Gly Tyr Ser Gly Pro Thr Val 



A 65 



470 



475 



Cys Ala Ser Gly Thr Thr Cys Gin Val Leu Asn Pro Tyr Tyr Ser Gin 



435 



490 



Cys Leu 



<210> 81 

<211> 1365 

<212> DNA 

<2I3> Trichoderma reesei 



<220> 
<221> 
<222> 
<223> 



CDS 

(1) . . (1365) 

trichoderma reesei ce llobiohydrolase II 



<400> 81 „ aa 

atg gtg cct eta gag gag egg caa get tgc tea age gtc tgg ggc caa 

Met Val Pro Leu Glu Glu Arg Gin Ala Cys Ser Ser Val Trp Gly Gin 
1 5 io 15 

tgt ggt ggc cag aat tgg teg ggt ccg act tgc tgt get tec gga age 

Cys Gly Gly Gin Asn Trp Ser Gly Pro Thr Cys Cys Ala Ser Gly Ser 

20 25 30 

aca tgc gtc tac tee aae gac tat tac tec cag tgt ett ccc ggc get 

Thr Cys Val Tyr Ser Asn Asp Tyr Tyr Ser Gin Cys Leu Pro Gly Ala 

35 4° 45 

gca age tea age teg tec acg cgc gee gcg teg acg act tea cga gta 

Ala Ser Ser Ser Ser Ser Thr Arg Ala Ala Ser Thr Thr Ser Arg Val 
50 55 60 

tec ccc aca aca tec egg teg age tec gcg acg cct cea cct ggt tct 

Ser Pro Thr Thr Ser Arg Ser Ser Ser Ala Thr Pro Pro Pro Gly Ser 

66 70 75 80 

acc act ace aga gta cct cea gtc gga teg gga ace get acg tat tea 

Thr Thr Thr Arg Val Pro Pro Val Gly Ser Gly Thr Ala Thr Tyr Ser 



48 



96 



144 



192 



240 



288 
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85 90 95 

ggc aac cct ttt gtt ggg gtc act cct tgg gcc aat gca tat tac gcc 

Gly Asn Pro Phe Val Gly Val Thr Pro Trp Ala Asn Ala Tyr Tyr Ala 

100 105 HO 

tct gaa gtt age age etc get att cct age ttg act gga gcc atg gcc 

Ser Glu Val Ser Ser Leu Ala lie Pro Ser Leu Thr Gly Ala Met Ala 
115 120 125 

act get gca gca get gtc gca aag gtt ccc tct ttt atg tgg eta gat 

Thr Ala Ala Ala Ala Val Ala Lys Val Pro Ser Phe Met Trp Leu Asp 
130 135 1^0 

act ctt gac aag ace cct etc atg gag caa ace ttg gcc gac ate cgc 

Thr Leu Asp Lys Thr Pro Leu Met Glu Gin Thr Leu Ala Asp lie Arg 

150 155 160 



145 



ace gcc aac aag aat ggc ggt aac tat gcc gga cag ttt gtg gtg tat 
Thr Ala Asn Lys Asn Gly Gly Asn Tyr Ala Gly Gin Phe Val Val Tyr 

165 170 175 

gac ttg ccg gat cgc qat tgc get gcc ctt gcc teg aat ggc gaa tac 
Asp Leu Pro Asp Arg Asp Cys Ala Ala Leu Ala Ser Asn Gly Glu Tyr 

180 185 190 

tct att gcc gat ggt ggc gtc gcc aaa tat aag aac tat ate gac ace 
Ser lie Ala Asp Gly Gly Val Ala Lys Tyr Lys Asn Tyr lie Asp Thr 
195 200 205 

att cgt caa att gtc gtg gaa tat tec gat ate egg ace etc ctg gtt 
He Arq Gin He Val Val Glu Tyr Ser Asp He Arg Thr Leu Leu Val 
210 215 220 

att gag cct gac tct ctt gcc aac ctg gtg ace aac etc ggt act cca 
lie Glu Pro Asp Ser Leu Ala Asn Leu Val Thr Asn Leu Gly Thr Pro 
225 230 235 240 

aag tgt gcc aat get cag tea gcc tac ctt gag tgc ate aac tac gcc 
Lvs Cys Ala Asn Ala Gin Ser Ala Tyr Leu Glu Cys He Asn Tyr Ala 
y 245 250 255 

gtc aca cag ctg aac ctt cca aat gtt gcg atg tat ttg gac get ggc 
Val Thr Gin Leu Asn Leu Pro Asn Val Ala Met Tyr Leu Asp Ala Gly 



60 



265 270 



cat gca gga tgg ctt ggc tgg ccg gca aac caa gac ccg gcc get cag 

His Ala Gly Trp Leu Gly Trp Pro Ala Asn Gin Asp Pro Ala Ala Gin 

275 280 285 

eta ttt gca aat gtt tac aag aat gca teg tct ccg aga get ctt cgc 

Leu Phe Ala Asn Val Tyr Lys Asn Ala Ser Ser Pro Arg Ala Leu Arg 

290 295 300 

qqa ttg gca ace aat gtc gcc aac tac aac ggg tgg aac att acc age 

Glv Leu Ala Thr Asn Val Ala Asn Tyr Asn Gly Trp Asn He Thr Ser 

305 310 315 320 



336 



384 



4 32 



480 



528 



57 6 



624 



672 



720 



768 



816 



864 



912 



960 
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ccc cca teg tac acg caa ggc aac get gtc tac aac gag aag ctg tac 
Pro Pro Ser Tyr Thr Gin Gly Asn Ala Val Tyr Asn Glu Lys Leu Tyr 

325 330 335 



1008 



ate cac get att gga cct ctt ctt gee aat cac ggc tgg tec aac gee 
lie His Ala lie Gly Pro Leu Leu Ala Asn His Gly Trp Ser Asn Ala 

340 345 350 



ttc ttc ate act gat caa ggt cga teg gga aag cag cct ace gga cag 
Phe Phe lie Thr Asp Gin Gly Arg Ser Gly Lys Gin Pro Thr Gly Gin 
355 360 365 

caa cag tgg gga gac tgg tgc aat gtg ate ggc acc gga ttt ggt att 
Gin Gin Trp Gly Asp Trp Cys Asn Val lie Gly Thr Gly Phe Gly He 
370 375 380 

cgc cca tec gca aac act ggg gac teg ttg ctg gat teg ttt gtc tgg 
Arg Pro Ser Ala Asn Thr Gly Asp Ser Leu Leu Asp Ser Phe Val Trp 
385 390 395 400 

gtc aag cca ggc ggc gag tgt gac ggc acc age gac age agt gcg cca 
Val Lys Pro Gly Gly Glu Cys Asp Gly Thr Ser Asp Ser Ser Ala Pro 

405 410 415 

cga ttt gac tec cac tgt gcg etc cca gat gee ttg caa ccg gcg cct 
Arq Phe Asp Ser His Cys Ala Leu Pro Asp Ala Leu Gin Pro Ala Pro 

420 425 430 

caa get ggt get tgg ttc caa gee tac ttt gtg cag ctt etc aca aac 
Gin Ala Gly Ala Trp Phe Gin Ala Tyr Phe Val Gin Leu Leu Thr Asn 
435 440 445 

gca aac cca teg ttc ctg tag 
Ala Asn Pro Ser Phe Leu 
4 50 



<210> 82 

<211> 454 

<212> PRT 

<213> Trichoderma reesei 



400> 82 



Met Val Pro Leu Glu Glu Arg Gin Ala Cys Ser Ser Val Trp Gly Gin 
1 5 10 15 

Cys Gly Gly Gin Asn Trp Ser Gly Pro Thr Cys Cys Ala Ser Gly Ser 

20 25 30 

Thr Cys Val Tyr Ser Asn Asp Tyr Tyr Ser Gin Cys Leu Pro Gly Ala 
35 40 45 

Ala Ser Ser Ser Ser Ser Thr Arg Ala Ala Ser Thr Thr Ser Arg Val 
50 55 60 



1056 



1104 



1152 



1200 



1248 



1296 



1344 



1365 
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Ser Pro Thr Thr Ser Arg Ser Ser Ser Ala Thr Pro Pro Pro Gly Ser 

65 ™ ^5 

Thr Thr Thr Arg Val Pro Pro Val Gly Ser Gly Thr Ala Thr Tyr Ser 

85 90 

Gly Asn Pro Phe Val Gly Val Thr Pro Trp Ala Asn Ala Tyr Tyr Ala 

100 105 110 

Ser Glu Val Ser Ser Leu Ala lie Pro Ser Leu Thr Gly Ala Met Ala 
115 120 125 

Thr Ala Ala Ala Ala Val Ala Lys Val Pro Ser Phe Met Trp Leu Asp 
130 135 

Thr Leu Asp Lys Thr Pro Leu Met Glu Gin Thr Leu Ala Asp lie Arg 

14 5 150 

Thr Ala Asn Lys Asn Gly Gly Asn Tyr Ala Gly Gin Phe Val Val Tyr 

165 I 70 

Asp Leu Pro Asp Arg Asp Cys Ala Ala Leu Ala Ser Asn Gly Glu Tyr 

180 185 19° 

Ser lie Ala Asp Gly Gly Val Ala Lys Tyr Lys Asn Tyr lie Asp Thr 
195 200 205 

He Arg Gin lie Val Val Glu Tyr Ser Asp lie Arg Thr Leu Leu Val 
210 215 220 



He Glu Pro Asp Ser Leu Ala Asn Leu Val Thr Asn Leu Gly Thr Pro 

225 



230 235 240 



Lys Cys Ala Asn Ala Gin Ser Ala Tyr Leu Glu Cys lie Asn Tyr Ala 

245 250 zdd 

Val Thr Gin Leu Asn Leu Pro Asn Val Ala Met Tyr Leu Asp Ala Gly 

260 265 270 

His Ala Gly Trp Leu Gly Trp Pro Ala Asn Gin Asp Pro Ala Ala Gin 
275 280 285 
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Leu Phe Ala Asn Val Tyr Lys Asn Ala Ser Ser Pro Arg Ala Leu Arg 
290 295 300 

Gly Leu Ala Thr Asn Val Ala Asn Tyr Asn Gly Trp Asn He Thr Ser 
305 310 315 320 

Pro Pro Ser Tyr Thr Gin Gly Asn Ala Val Tyr Asn Glu Lys Leu Tyr 

325 330 335 

He His Ala He Gly Pro Leu Leu Ala Asn His Gly Trp Ser Asn Ala 

340 345 350 

Phe Phe lie Thr Asp Gin Gly Arg Ser Gly Lys Gin Pro Thr Gly Gin 
355 360 365 

Gin Gin Trp Gly Asp Trp Cys Asn Val He Gly Thr Gly Phe Gly He 
370 375 380 

Arg Pro Ser Ala Asn Thr Gly Asp Ser Leu Leu Asp Ser Phe Val Trp 
385 390 395 400 

Val Lys Pro Gly Gly Glu Cys Asp Gly Thr Ser Asp Ser Ser Ala Pro 

405 410 415 

Arg Phe Asp Ser His Cys Ala Leu Pro Asp Ala Leu Gin Pro Ala Pro 

420 425 430 

Gin Ala Gly Ala Trp Phe Gin Ala Tyr Phe Val Gin Leu Leu Thr Asn 
435 ~ 440 445 



Ala Asn Pro Ser Phe Leu 
450 



<210> 83 

<211> 1317 

< 2 1 2 > DNA 

<213> Trichoderma reesei 



<220> 

<221> CDS 

<222> (1) . . (1317) 

<223> Trichoderma reesei endoglucanase I 
<400> 83 

atg cag caa ccg gga acc age acc ccc gag gtc cat ccc aag ttg aca 
Met Gin Gin Pro Gly Thr Ser Thr Pro Glu Val His Pro Lys Leu Thr 



48 
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1 5 io 15 

acc tac aag tgc aca aag tec ggg ggg tgc gtg gec cag gac acc teg 
Thr Tyr Lys Cys Thr Lys Ser Gly Gly Cys Val Ala Gin Asp Thr Ser 

20 25 

atq ate ctt gac tgg aac tac cgc tgg atg cac gac gca aac tac aac 
Val Val Leu Asp Trp Asn Tyr Arg Trp Met His Asp Ala Asn Tyr Asn 
35 40 45 

tea tac acc ate aac ggc ggc gtc aac acc acg etc tgc cct gac gag 
Ser Cys Thr Val Asn Gly Gly Val Asn Thr Thr Leu Cys Pro Asp Glu 
50 55 60 

aca acc tgt ggc aag aac tgc ttc ate gag ggc gtc gac tac gec gee 

Ala Thr Cys Gly Lys Asn Cys Phe He Glu Gly Val Asp Tyr Ala Ala 

65 ™ 75 

tcq ggc gtc acg acc teg ggc age age etc acc atg aac cag tac atg 

Ser Gly Val Thr Thr Ser Gly Ser Ser Leu Thr Met Asn Gin Tyr Met 

85 90 ^ 

crc aqc age tct ggc ggc tac age age gtc tct cct egg ctg tat etc 
Pro Ser Ser Ser Gly Gly Tyr Ser Ser Val Ser Pro Arg Leu Tyr Leu 

100 1° 5 110 

ctg gac tct gac ggt gag tac gtg atg ctg aag etc aac ggc cag gag 
Leu Asp Ser Asp Gly Glu Tyr Val Met Leu Lys Leu Asn Gly Gin Glu 
115 120 125 

eta aac ttc gac gtc gac etc tct get ctg ccg tgt gga gag aac ggc 
Leu Ser Pne Asp Val Asp Leu Ser Ala Leu Pro Cys Gly Glu Asn Gly 
130 135 I 40 

tea etc tac ctg tct cag atg gac gag aac ggg ggc gec aac cag tat 
Hi Leu Tyr Leu Ser Gin Met Asp Glu Asn Gly Gly Ala Asn Gin Tyr 
145 150 155 ibu 

aac aca acc ggt gec aac tac ggg age ggc tac tgc gat get cag tgc 
Asn Thr Ala Gly Ala Asn Tyr Gly Ser Gly Tyr Cys Asp Ala Gin Cys 

165 I 70 13 

ccc gtc cag aca tgg agg aac ggc acc etc aac act age cac cag ggc 
Pro Val Gin Thr Trp Arg Asn Gly Thr Leu Asn Thr Ser His Gin Gly 

180 185 I 90 

ttc tgc tgc aac gag atg gat ate ctg gag ggc aac teg agg gcg aat 
Phe Cys Cys Asn Glu Met Asp He Leu Glu Gly Asn Ser Arg Ala Asn 
195 200 205 

gec ttg acc cct cac tct tgc acg gee acg gec tgc gac tct gee ggt 
Ala Leu Thr Pro His Ser Cys Thr Ala Thr Ala Cys Asp Ser Ala Gly 
210 215 220 

tac aac ttc aac ccc tat ggc age ggc tac aaa age tac tac ggc ccc 
Cys Gly Phe Asn Pro Tyr Gly Ser Gly Tyr Lys Ser Tyr Tyr Gly Pro 
225 230 



96 



144 



192 



240 



288 
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384 



432 



480 



528 



576 



624 



672 



720 
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gga gat acc gtt gac acc tec aag acc ttc acc ate ate ace cag ttc 
Gly Asp Thr Val Asp Thr Ser Lys Thr Phe Thr lie lie Thr Gin Phe 

245 2 50 2oo 

aac acg gac aac ggc teg ccc teg ggc aac ctt gtg age ate acc cgc 
Asn Thr Asp Asn Gly Ser Pro Ser Gly Asn Leu Val Ser lie Thr Arg 

260 265 270 

aag tac cag caa aac ggc gtc gac ate ccc age gee cag ccc ggc ggc 
Lys Tyr Gin Gin Asn Gly Val Asp lie Pro Ser Ala Gin Pro Gly Gly 
275 280 285 

gac acc ate teg tec tgc ccg tec gee tea gee tac ggc ggc etc gee 
Asp Thr lie Ser Ser Cys Pro Ser Ala Ser Ala Tyr Gly Gly Leu Ala 
290 295 300 

acc atg ggc aag gee ctg age age ggc atg gtg etc gtg ttc age att 
Thr Met Gly Lys Ala Leu Ser Ser Gly Met Val Leu Val Phe Ser lie 
305 310 315 320 

tgg aac gac aac age cag tac atg aac tgg etc gac age ggc aac gee 

~ '• ' ' - Ser "' t „ , c,-:.v- iz 1 w tt = r. Ala 

325 



768 



tgg aac gac aac age cag tac atg aao uyy cc yov. 

Trp Asn Asp Asn Ser Gin Tyr Met Asn Trp Leu Asp Ser Gly Asn Ala 
"' ; 330 335 



ggc ccc tgc age age acc gag ggc aac cca tec aac acc ctg gec aac 

Glv Pro Cys Ser Ser Thr Glu Gly Asn Pro Ser Asn Thr Leu Ala Asn 

340 345 350 

aac ccc aac acg cac gtc gtc ttc tec aac ate cgc tgg gga gac att 

Asn Pro Asn Thr His Val Val Phe Ser Asn lie Arg Trp Gly Asp lie 

355 360 365 

ggg tct act acg aac teg act gcg ccc ccg ccc ccg cct gcg tec age 

Glv Ser Thr Thr Asn Ser Thr Ala Pro Pro Pro Pro Pro Ala Ser Ser 

370 375 380 

acg acg ttt teg act aca egg agg age teg acg act teg age age ccg 

Thr Thr Phe Ser Thr Thr Arg Arg Ser Ser Thr Thr Ser Ser Ser Pro 

385 390 395 400 

age tgc acg cag act cac tgg ggg cag tgc ggt ggc att ggg tac age 

Ser Cvs Thr Gin Thr His Trp Gly Gin Cys Gly Gly He Gly Tyr Ser 

' " J ■ - - 410 4 15 



405 



ggg tgc aag acg tgc acg teg ggc act acg tgc cag tat age aac gac 
Gly Cys Lys Thr Cys Thr Ser Gly Thr Thr Cys Gin Tyr Ser Asn Asp 

^ *1— 



420 

tac tac teg caa tgc ctt tag 
Tyr Tyr Ser Gin Cys Leu 
4 3 5 



<2i0> 84 

<2il> 438 

<212> PRT 

<213> Trichoderma reesei 



816 



864 



912 



960 



1008 



1056 



1104 



1152 



1200 



1248 



1296 



1317 
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<400> 84 



Met Gin Gin Pro Gly Thr Ser Thr Pro Glu Val His Pro Lys Leu Thr 
i r 10 15 



Thr Tyr Lys Cys Thr Lys Ser Gly Gly Cys Val Ala Gin Asp Thr Ser 

20 25 30 

Val Val Leu Asp Trp Asn Tyr Arg Trp Met His Asp Ala Asn Tyr Asn 
35 40 45 

Ser Cys Thr Val Asn Gly Gly Val Asn Thr Thr Leu Cys Pro Asp Glu 
50 55 60 

A^a Thr Cys Gly Lys Asn Cys Phe He Glu Gly Val Asp Tyr Ala Ala 
65 70 75 80 

Ser Glv Val Thr Thr Ser Gly Ser Ser Leu Thr Met Asn Gin Tyr Met 

85 90 95 

Pro Ser Ser Ser Gly Gly Tyr Ser Ser Val Ser Pro Arg Leu Tyr Leu 

100 105 HO 

Leu Asp Ser Asp Gly Glu Tyr Val Met Leu Lys Leu Asn Gly Gin Glu 
115 120 125 

Leu Ser Phe Asp Val Asp Leu Ser Ala Leu Pro Cys Gly Glu Asn Gly 
130 135 140 

Ser Leu Tyr Leu Ser Gin Met Asp Glu Asn Gly Gly Ala Asn Gin Tyr 
145 150 155 160 

Asn Thr Ala Gly Ala Asn Tyr Gly Ser Gly Tyr Cys Asp Ala Gin Cys 

165 170 175 

Pro Val Gin Thr Trp Arg Asn Gly Thr Leu Asn Thr Ser His Gin Gly 

180 185 190 

Phe Cys Cys Asn Glu Met Asp He Leu Glu Gly Asn Ser Arg Ala Asn 
195 200 205 

Ala Leu Thr Pro His Ser Cys Thr Ala Thr Ala Cys Asp Ser Ala Gly 
210 215 , 220 
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Cys Gly Phe Asn Pro Tyr Gly Ser Gly Tyr Lys Ser Tyr Tyr Gly Pro 

225 230 235 240 

Gly Asp Thr Val Asp Thr Ser Lys Thr Phe Thr He He Thr Gin Phe 

245 250 255 



Asn Thr Asp Asn Gly Ser Pro Ser Gly Asn Leu Val Ser He Thr Arg 

260 265 270 

Lys Tyr Gin Gin Asn Gly Val Asp He Pro Ser Ala Gin Pro Gly Gly 
275 280 285 

Asp Thr He Ser Ser Cys Pro Ser Ala Ser Ala Tyr Gly Gly Leu Ala 
290 295 300 

Thr Met Gly Lys Ala Leu Ser Ser Gly Met Val Leu Val Phe Ser He 
305 310 315 320 

Trp Asn Asp Asn Ser Gin Tyr Met Asn Trp Leu Asp Ser Gly Asn Ala 

325 330 335 



Gly Pro Cys Ser Ser Thr Glu Gly Asn Pro Ser Asn Thr Leu Ala Asn 

340 345 350 



Asn Pro Asn Thr His Val Val Phe Ser Asn He Arg Trp Gly Asp He 
355 360 365 

Gly Ser Thr Thr Asn Ser Thr Ala Pro Pro Pro Pro Pro Ala Ser Ser 
370 375 380 

Thr Thr Phe Ser Thr Thr Arg Arg Ser Ser Thr Thr Ser Ser Ser Pro 
385 390 395 400 

Ser Cys Thr Gin Thr His Trp Gly Gin Cys Gly Gly He Gly Tyr Ser 

405 410 415 



Gly Cys Lys Thr Cys Thr Ser Gly Thr Thr Cys Gin Tyr Ser Asn Asp 

420 425 430 



Tyr Tyr Ser Gin Cys Leu 
4 35 



<210> 85 
<211> 954 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 6GP1 



<220> 

<221> CDS 

<222> (1) . . (954 ) 

<223> 6GP1 



<400> 85 

atg ggc gtg gac ccg ttc gag cgc aac aag ate etc ggc cgc ggc ate 

Met Gly Vai Asp Pro Phe Glu Arg Asn Lys He Leu Gly Arg Gly lie 
15 10 15 

aac ate ggc aac gee ctg gag gee ccg aac gag ggc gac tgg ggc gtg 
Asn He Gly Asn Ala Leu Glu Ala Pro Asn Glu Gly Asp Trp Gly Val 

20 25 30 

gtg ate aag gac gag ttc ttc gac ate ate aag gag gee ggc ttc tec 
Val lie Lys Asp Glu Phe Phe Asp He He Lys Glu Ala Gly Phe Ser 
35 40 45 

cac gtg cgc ate ccg ate cgc tgg tec ace cac gee tac gee ttc ccg 
His Val Arg He Pro lie Arg Trp Ser Thr His Ala Tyr Ala Phe Pro 
50 55 60 

ccg tac aag ate atg gac cgc ttc ttc aag cgc gtg gac gag gtg ate 
Pro Tyr Lys He Met Asp Arg Phe Phe Lys Arg Val Asp Glu Val He 
65 70 75 80 

aac ggc gee etc aag cgc ggc etc gee gtg gee ate aac ate cac cac 
Asn Gly Ala Leu Lys Arg G.ly Leu Ala Val Ala He Asn lie His His 

85 90 95 

tac gag gag etc atg aac gac ccg gag gag cac aag gag cgc ttc etc 
Tyr Glu Glu Leu Met Asn Asp Pro Glu Glu His Lys Glu Arg Phe Leu 

100 105 110 

gee etc tgg aag cag ate gee gac cgc tac aag gac tac ccg gag ace 
Ala Leu Trp Lys Gin lie Ala Asp Arg Tyr Lys Asp Tyr Pro Glu Thr 
115 120 125 

etc ttc ttc gag ate etc aac gag ccg cac ggc aac etc ace ccg gag 
Leu Phe Phe Glu He Leu Asn Glu Pro His Gly Asn Leu Thr Pro Glu 
130 135 140 

aag tgg aac gag ctg etc gag gag gee etc aag gtg ate cgc tec ate 
Lys Trp Asn Glu Leu Leu Glu Glu Ala Leu Lys Val lie Arg Ser lie 
145 150 155 160 

gac aag aag cac ace ate ate att ggc ace gca gag tgg gga ggc ate 
Asp Lys Lys His Thr lie lie lie Gly Thr Ala Glu Trp Gly Gly lie 

165 170 175 

tec gee etc gag aag etc tec gtg ccg aag tgg gag aag aat tec ate 



48 



96 



144 



192 



240 



288 



336 



384 



432 



480 



528 



576 
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Ser Ala Leu Glu Lys Leu Ser Val Pro Lys Trp Glu Lys Asn- Ser lie 

180 185 190 

gtg acc ate cac tac tac aac ccg ttc gag ttc acg cac cag ggc gec 
Val Thr He His Tyr Tyr Asn Pro Phe Glu Phe Thr His Gin Gly Ala 
195 200 205 

gag tgg gtg gag ggc tec gag aag tgg ctt ggc cgc aag tgg ggc tec 
Glu Trp Val Glu Gly Ser Glu Lys Trp Leu Gly Arg Lys Trp Gly Ser 
210 215 220 

ccg gac gac cag aag cac etc ate gag gag ttc aac ttc ate gag gag 
Pro Asp Asp Gin Lys His Leu He Glu Glu Phe Asn Phe He Glu Glu 
225 230 235 240 

tgg tec aag aag aac aag cgc ccg ate tac ate ggc gag ttt ggc gee 
Trp Ser Lys Lys Asn Lys Arg Pro He Tyr He Gly Glu Phe Gly Ala 

24 5 250 255 



624 



672 



720 



768 



tac cgc aag gee gac etc gag tec cgc ate aag tgg acc tec ttc gtg 
Tyr Arg Lys Ala Asp Leu Glu Ser Arg He Lys Trp Thr Ser Phe Val 

260 265 270 



816 



gtg cgt gag atg gag aag cgc cgc tgg tec tgg gee tac tgg gag ttc 
Val Arg Glu Met Glu Lys Arg Arg Trp Ser Trp Ala Tyr Trp Glu Phe 
275 280 285 

tgc tec ggc ttc ggc gtg tac gac acc etc cgc aag acc tgg aac aag 
Cys Ser Gly Phe Gly Val Tyr Asp Thr Leu Arg Lys Thr Trp Asn Lys 
290 295 300 

gac etc etc gag gee etc ate ggc ggc gac tec ate gag tag 
Asp Leu Leu Glu Ala Leu lie Gly Gly Asp Ser lie Glu 



305 



310 



315 



864 



912 



954 



<210> 86 
<211> 317 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 
<400> 86 

Met Gly Val Asp Pro Phe Glu Arg Asn Lys lie Leu Gly Arg Gly He 
15 10 15 

Asn He Gly Asn Ala Leu Glu Ala Pro Asn Glu Gly Asp Trp Gly Val 

20 25 30 



Val lie Lys Asp Glu Phe Phe Asp lie lie Lys Glu Ala Gly Phe Ser 
3 5 4 0 4 5 
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His Val Arg lie 
50 



Pro Tyr Lys He 
65 



Asn Gly Ala Leu 



Tyr Glu Glu Leu 

100 



Ala Leu Trp Lys 
115 



Leu Phe Phe Glu 
130 



Lys Trp Asn Glu 
14 5 



Asp Lys Lys His 



Ser Ala Leu Glu 

180 



Val Thr He His 
195 



Glu Trp Val Glu 
210 



Pro Asp Asp Gin 
225 



Trp Ser Lys Lys 



Tyr Arg Lys Ala 

2 60 



Val Arg Glu Met 

27 5 



Pro He Arg Trp 
55 



Met Asp Arg Phe 
70 



Lys Arg Gly Leu 
85 



Met Asn Asp Pro 



Gin He Ala Asp 

120 



lie Leu Asn Glu 
135 



Leu Leu Glu Glu 
150 



Thr lie lie lie 
165 



Lys Leu Ser Val 



Tyr Tyr Asn Pro 

200 



Gly Ser Glu Lys 
215 



Lys His Leu He 
230 



Asn Lys Arg Pro 
2 4 5 



Asp Leu Glu Ser 



Glu Lys Arg Arg 

280 



Ser Thr His Ala 

60 



Phe Lys Arg Val 
75 



Ala Val Ala He 
90 



Glu Glu His Lys 
105 



Arg Tyr Lys Asp 



Pro His Gly Asn 

140 



Ala Leu Lys Val 
155 



Gly Thr Ala Glu 
170 



Pro Lys Trp Glu 
185 



Phe Glu Phe Thr 



Trp Leu Gly Arg 

220 



Glu Glu Phe Asn 
235 



He Tyr He Gly 
250 



Arg He Lys Trp 
265 



Trp Ser Trp Ala 



Tyr Ala Phe Pro 



Asp Glu Val He 

80 



Asn lie His His 
95 



Glu Arg Phe Leu 
110 



Tyr Pro Glu Thr 
125 



Leu Thr Pro Glu 



He Arg Ser lie 

160 



Trp Gly Gly He 
175 



Lys Asn Ser lie 
190 



His Gin Gly Ala 
205 



Lys Trp Gly Ser 



Phe He Glu Glu 

240 



Glu Phe Gly Ala 
255 



Thr Ser Phe Val 
270 



Tyr Trp Glu Phe 
285 
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Cys Ser Gly Phe Gly Val Tyr Asp Thr Leu Arg Lys Thr Trp Asn Lys 



290 



295 



300 



Asp Leu Leu Glu Ala Leu He Gly Gly Asp Ser He Glu 
305 310 315 



<210> 
<211> 
<212> 
<213> 



87 

1248 
DNA 

Horde urn vulgare 



<220> 
<221> 
<222> 
<223> 



CDS 

(1 ) . . (1248) 

Barley Amy I amylase 



<400> 87 

atg gca cac caa gtc etc ttt cag ggg ttc aac tgg gag teg tgg aag 

Met Ala His Gin Val Leu Phe Gin Gly Phe Asn Trp Glu Ser Trp Lys 
15 10 15 

cag age ggc ggg tgg tac aac atg atg atg ggc aag gtc gac gac ate 
Gin Ser Gly Gly Trp Tyr Asn Met Met Met Gly Lys Val Asp Asp He 

20 25 30 

gec get gee gga gtc ace cac gtc tgg ctg cca ccg ccg teg cac tec 
Ala Ala Ala Gly Val Thr His Val Trp Leu Pro Pro Pro Ser His Ser 
35 40 45 

gtc tec aac gaa ggt tac atg cct ggt egg ctg tac gac ate gac gcg 
Val Ser Asn Glu Gly Tyr Met Pro Gly Arg Leu Tyr Asp He Asp Ala 
50 55 60 

tec aag tac ggc aac gcg gcg gag etc aag teg etc ate ggc gcg etc 
Ser Lys Tyr Gly Asn Ala Ala Glu Leu Lys Ser Leu He Gly Ala Leu 
65 70 75 80 

cac ggc aag ggc gtg cag gee ate gee gac ate gtc ate aac cac cgc 
His Gly Lys Gly Val Gin Ala He Ala Asp He Val He Asn His Arg 

85 90 95 

tgc gee gac tac aag gat age cgc ggc ate tac tgc ate ttc gag ggc 
Cys Ala Asp Tyr Lys Asp Ser Arg Gly He Tyr Cys He Phe Glu Gly 

100 105 110 

ggc ace tec gac ggc cgc etc gac tgg ggc ccc cac atg ate tgt cgc 
Gly Thr Ser Asp Gly Arg Leu Asp Trp Gly Pro His Met He Cys Arg 
115 120 125 

gac gac ace aaa tac tec gat ggc ace gca aac etc gac ace gga gee 
Asp Asp Thr Lys Tyr Ser Asp Gly Thr Ala Asn Leu Asp Thr Gly Ala 
130 ^ 135 140 



48 



96 



144 



192 



240 



288 



336 



384 



432 
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gac ttc gcc gcc gcg ccc gac ate gac cac etc aac gac egg gtc cag 480 
Asp Phe Ala Ala Ala Pro Asp lie Asp His Leu Asn Asp Arg Val Gin 
145 150 155 160 

cgc gag etc aag gag tgg etc etc tgg etc aag age gac etc ggc ttc 528 
Arg Glu Leu Lys Glu Trp Leu Leu Trp Leu Lys Ser Asp Leu Gly Phe 

165 170 175 

gac gcg tgg cgc ctt gac ttc gcc agg ggc tac teg ccg gag atg gcc 576 
Asp Ala Trp Arg Leu Asp Phe Ala Arg Gly Tyr Ser Pro Glu Met Ala 

180 185 190 

aag gtg tac ate gac ggc aca tec ccg age etc gcc gtg gcc gag gtg 624 
Lys Val Tyr lie Asp Gly Thr Ser Pro Ser Leu Ala Val Ala Glu Val 
195 200 205 

tgg gac aat atg gcc ace ggc ggc gac ggc aag ccc aac tac gac cag 672 
Trp Asp Asn Met Ala Thr Gly Gly Asp Gly Lys Pro Asn Tyr Asp Gin 
210 215 220 

gac gcg cac egg cag aat ctg gtg aac tgg gtg gac aag gtg ggc ggc 720 
Asp Ala His Arg Gin Asn Leu Val Asn Trp Val Asp Lys Val Gly Gly 
225 230 235 240 

gcg gcc teg gca ggc atg gtg ttc gac ttc acg acc aaa ggg ata ctg 768 
Ala Ala Ser Ala Gly Met Val Phe Asp Phe Thr Thr Lys Gly lie Leu 

245 250 255 

aac get gcc gtg gag ggc gag ctg tgg agg ctg ate gac ccg cag ggg 816 
Asn Ala Ala Val Glu Gly Glu Leu Trp Arg Leu lie Asp Pro Gin Gly 

260 265 270 

aag gcc ccc ggc gtg atg gga tgg tgg ccg gcc aag gcc gtc acc ttc 864 
Lys Ala Pro Gly Val Met Gly Trp Trp Pro Ala Lys Ala Val Thr Phe 
275 280 285 

gtc gac aac cac gat aca ggc tec acg cag gcc atg tgg cca ttc ccc 912 
Val Asp Asn His Asp Thr Gly Ser Thr Gin Ala Met Trp Pro Phe Pro 
290 295 300 

tec gac aag gtc atg cag ggc tac gcg tac ate etc acc cac ccc ggc 960 
Ser Asp Lys Val Met Gin Gly Tyr Ala Tyr lie Leu Thr His Pro Gly 
305 310 315 320 

ate cca tgc ate ttc tac gac cat ttc ttc aac tgg ggg ttt aag gac 1008 
lie Pro Cys lie Phe Tyr Asp His Phe Phe Asn Trp Gly Phe Lys Asp 

325 330 335 

cag ate gcg gcg ctg gtg gcg ate agg aag cgc aac ggc ate acg gcg 1056 
Gin lie Ala Ala Leu Val Ala lie Arg Lys Arg Asn Gly lie Thr Ala 

340 345 350 

acg age get ctg aag ate etc atg cac gaa gga gat gcc tac gtc gcc 1104 
Thr Ser Ala Leu Lys lie Leu Met His Glu Gly Asp Ala Tyr Val Ala 
355 360 365 

gag ata gac ggc aag gtg gtg gtg aag ate ggg tec agg tac gac gtc 1152 
Glu He Asp Gly Lys Val Val Val Lys He Gly Ser Arg Tyr Asp Val 
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370 375 380 

999 gcg gtg ate ccg gee ggg ttc gtg ace teg gca cac ggc aac gac 1200 

Gly Ala Val He Pro Ala Gly Phe Val Thr Ser Ala His Gly Asn Asp 

385 390 395 400 



tac gee gtc tgg gag aag aac ggt gee gcg gca aca eta caa egg age 
Tyr Ala Val Trp Glu Lys Asn Gly Ala Ala Ala Thr Leu Gin Arg Ser 

405 ' 410 415 



<210> 88 
< 2 1 1 > 4 16 
<212> PRT 

<213> Horcleum vulgare 
<4 00> 8 8 

Met Ala His Gin Val Leu Phe Gin Gly Phe Asn Trp Glu Ser Trp Lys 
15 10 15 

Gin Ser Gly Gly Trp Tyr Asn Met Met Met Gly Lys Val Asp Asp He 

20 25 30 

Ala Ala Ala Gly Val Thr His Val Trp Leu Pro Pro Pro Ser His Ser 
35 40 45 

Val Ser Asn Glu Gly Tyr Met Pro Gly Arg Leu Tyr Asp He Asp Ala 
50 55 60 

Ser Lys Tyr Gly Asn Ala Ala Glu Leu Lys Ser Leu He Gly Ala Leu 
65 70 75 80 

His Gly Lys Gly Val Gin Ala He Ala Asp He Val He Asn His Arg 

8 5 90 95 



Cvs Ala Asp Tyr Lys Asp Ser Arg Gly He Tyr Cys lie Phe Glu Gly 

100 " 105 HO 



Gly Thr Ser Asp Gly Arg Leu Asp Trp Gly Pro His Met He Cys Arg 
115 120 125 

Asp Asp Thr Lys Tyr Ser Asp Gly Thr Ala Asn Leu Asp Thr Gly Ala 
130 135 140 

Asp Phe Ala Ala Ala Pro Asp He Asp His Leu Asn Asp Arg Val Gin 

145 150 155 160 



1248 
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Arg Glu Leu Lys Glu Trp Leu Leu Trp Leu Lys Ser Asp Leu Gly Phe 

165 170 175 

Asp Ala Trp Arg Leu Asp Phe Ala Arg Gly Tyr Ser Pro Glu Met Ala 

180 185 190 

Lys Val Tyr lie Asp Gly Thr Ser Pro Ser Leu Ala Val Ala Glu Val 
195 200 205 

Trp Asp Asn Met Ala Thr Gly Gly Asp Gly Lys Pro Asn Tyr Asp Gin 
210 215 220 

Asp Ala His Arg Gin Asn Leu Val Asn Trp Val Asp Lys Val Gly Gly 
225 230 235 240 

Ala Ala Ser Ala Gly Met Val Phe Asp Phe Thr Thr Lys Gly lie Leu 

245 250 255 

Asn Ala Ala Val Glu Gly Glu Leu Trp Arg Leu He Asp Pro Gin Gly 

260 265 270 

Lys Ala Pro Gly Val Met Gly Trp Trp Pro Ala Lys Ala Val Thr Phe 
275 280 285 

Val Asp Asn His Asp Thr Gly Ser Thr Gin Ala Met Trp Pro Phe Pro 
290 295 300 

Ser Asp Lys Val Met Gin Gly Tyr Ala Tyr He Leu Thr His Pro Gly 
305 310 315 320 

He Pro Cys He Phe Tyr Asp His Phe Phe Asn Trp Gly Phe Lys Asp 

325 330 335 

Gin He Ala Ala Leu Val Ala lie Arg Lys Arg Asn Gly He Thr Ala 

340 345 350 

Thr Ser Ala Leu Lys lie Leu Met His Glu Gly Asp Ala Tyr Val Ala 
355 360 365 

Glu He Asp Gly Lys Val Val Val Lys He Gly Ser Arg Tyr Asp Val 
370 375 380 

Gly Ala Val He Pro Ala Gly Phe Val Thr Ser Ala His Gly Asn Asp 
385 390 395 400 
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Tyr Ala Val Trp Glu Lys Asn Gly Ala Ala Ala Thr Leu Gin Arg Ser 

405 410 415 



<210> 89 

<211> 1401 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Trichoderma reesei fi-Glucosidase 2 



<220> 
<221> 
<222> 
<223> 



CDS 

{ 1 ) . ■ ( 1401 ) 

Trichoderma reesei fr-Glucos idase 2 



<400> 89 

atg ttg ccc aag gac ttt cag tgg ggg ttc gcc acg get gec tac cag 

Met Leu Pro Lys Asp Phe Gin Trp Gly Phe Ala Thr Ala Ala Tyr Gin 
15 10 15 

ate gag ggc gcc gtc gac cag gac ggc cgc ggc ccc age ate tgg gac 
lie Glu Gly Ala Val Asp Gin Asp Gly Arg Gly Pro Ser lie Trp Asp 

20 25 30 

acg ttc tgc gcg cag ccc ggc aag ate gcc gac ggc teg teg ggc gtg 
Thr Phe Cys Ala Gin Pro Gly Lys lie Ala Asp Gly Ser Ser Gly Val 
35 40 45 

acg gcg tgc gac teg tac aac cgc acg gcc gag gac att gcg ctg ctg 
Thr Ala Cys Asp Ser Tyr Asn Arg Thr Ala Glu Asp lie Ala Leu Leu 
50 55 60 

aag teg etc ggg gcc aag age tac cgc ttc tec ate teg tgg teg cgc 
Lys Ser Leu Gly Ala Lys Ser Tyr Arg Phe Ser He Ser Trp Ser Arg 
65 70 75 80 

ate ate ccc gag ggc ggc cgc ggc gat gcc gtc aac cag gcg ggc ate 
He He Pro Glu Gly Gly Arg Gly Asp Ala Val Asn Gin Ala Gly lie 

85 90 95 

gac cac tac gtc aag ttc gtc gac gac ctg etc gac gcc ggc ate acg 
Asp His Tyr Val Lys Phe Val Asp Asp Leu Leu Asp Ala Gly He Thr 

100 105 110 

ccc ttc ate acc etc ttc cac tgg gac ctg ccc gag ggc ctg cat cag 
Pro Phe He Thr Leu Phe His Trp Asp Leu Pro Glu Gly Leu His Gin 
115 120 125 

egg tac ggg ggg ctg ctg aac cgc acc gag ttc ccg etc gac ttt gaa 
Arg Tyr Gly Gly Leu Leu Asn Arg Thr Glu Phe Pro Leu Asp Phe Glu 
130 135 140 

aac tac gcc cgc gtc atg ttc agg gcg ctg ccc aag gtg cgc aac tgg 



48 



96 



144 



192 



240 



288 



336 



384 



432 



480 
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Asn Tyr Ala Arg Val Met Phe Arg Ala Leu Pro Lys Val Arg Asn Trp 
H5 150 155 160 

ate acc ttc aac gag ccg ctg tgc teg gec ate ccg ggc tac ggc tee 528 
lie Thr Phe Asn Glu Pro Leu Cys Ser Ala He Pro Gly Tyr Gly Ser 

165 170 175 

ggc acc ttc gec ccc ggc egg cag age acc teg gag ccg tgg acc gtc 576 
Gly Thr Phe Ala Pro Gly Arg Gin Ser Thr Ser Glu Pro Trp Thr Val 

180 185 190 

ggc cac aac ate etc gtc gee cac ggc cgc gec gtc aag gcg tac cgc 624 
Gly His Asn He Leu Val Ala His Gly Arg Ala Val Lys Ala Tyr Arg 
195 200 205 

gac gac ttc aag ccc gec age ggc gac ggc cag ate ggc ate gtc etc 672 
Asp Asp Phe Lys Pro Ala Ser Gly Asp Gly Gin He Gly He Val Leu 
210 215 220 

aac ggc gac ttc acc tac ccc tgg gac gec gee gac ccg gec gac aag 720 
Asn Gly Asp Phe Thr Tyr Pro Trp Asp Ala Ala Asp Pro Ala Asp Lys 
225 230 235 240 

gag gcg gee gag egg cgc etc gag ttc ttc acg gec tgg ttc gcg gac 768 
Glu Ala Ala Glu Arg Arg Leu Glu Phe Phe Thr Ala Trp Phe Ala Asp 

245 250 255 

ccc ate tac ttg ggc gac tac ccg gcg teg atg cgc aag cag ctg ggc 816 
Pro He Tyr Leu Gly Asp Tyr Pro Ala Ser Met Arg Lys Gin Leu Gly 

260 265 270 

gac egg ctg ccg acc ttt acg ccc gag gag cgc gee etc gtc cac ggc 864 
Asp Arg Leu Pro Thr Phe Thr Pro Glu Glu Arg Ala Leu Val His Gly 
275 280 285 

tec aac gac ttt tac ggc atg aac cac tac acg tec aac tac ate cgc 912 
Ser Asn Asp Phe Tyr Gly Met Asn His Tyr Thr Ser Asn Tyr He Arg 
290 295 300 

cac cgc age teg ccc gee tec gee gac gac acc gtc ggc aac gtc gac 960 
His Arg Ser Ser Pro Ala Ser Ala Asp Asp Thr Val Gly Asn Val Asp 
305 310 315 320 

gtg etc ttc acc aac aag cag ggc aac tgc ate ggc ccc gag acg cag 1008 
Val Leu Phe Thr Asn Lys Gin Gly Asn Cys He Gly Pro Glu Thr Gin 

325 330 335 

tec ccc tgg ctg cgc ccc tgt gee gee ggc ttc cgc gac ttc ctg gtg 1056 
Ser Pro Trp Leu Arg Pro Cys Ala Ala Gly Phe Arg A.sp Phe Leu Val 

340 345 350 

tgg ate age aag agg tac ggc tac ccg ccc ate tac gtg acg gag aac 1104 
Trp He Ser Lys Arg Tyr Gly Tyr Pro Pro He Tyr Val Thr Glu Asn 
355 360 365 

ggc acg age ate aag ggc gag age gac ttg ccc aag gag aag att etc 1152 
Gly Thr Ser He Lys Gly Glu Ser Asp Leu Pro Lys Glu Lys He Leu 
370 375 380 
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gaa gat gac ttc agg gtc aag tac tat aac gag tac ate cgt gec atg 
Glu Asp Asp Phe Arg Val Lys Tyr Tyr Asn Glu Tyr lie Arg Ala Met 



385 



390 



395 



400 



gtt acc gec gtg gag ctg gac ggg gtc aac gtc aag ggg tac ttt gec 
Val Thr Ala Val Glu Leu Asp Gly Val Asn Val Lys Gly Tyr Phe Ala 

405 410 415 

tgg teg etc atg gac aac ttt gag tgg gcg gac ggc tac gtg acg agg 
Trp Ser Leu Met Asp Asn Phe Glu Trp Ala Asp Gly Tyr Val Thr Arg 

420 425 430 

ttt ggg gtt acg tat gtg gat tat gag aat ggg cag aag egg ttc cc.c 
Phe Gly Val Thr Tyr Val Asp Tyr Glu Asn Gly Gin Lys Arg Phe Pro 
435 440 445 

aag aag age gca aag age ttg aag ccg ctg ttt gac gag ctg att gcg 
Lys Lys Ser Ala Lys Ser Leu Lys Pro Leu Phe Asp Glu Leu He Ala 
450 455 460 

gcg gcg tga 
Ala Ala 
465 



1200 



1248 



1296 



1344 



1392 



1401 



<21C> 
< 2 1 1 > 
<2I2> 
<213> 



90 
4 66 
PRT 

Artificial Sequence 



<220> 

<223> Synthetic Construct 
<400> 90 

Met Leu Pro Lys Asp Phe Gin Trp Gly Phe Ala Thr Ala Ala Tyr Gin 
15 10 15 

lie Glu Gly Ala Val Asp Gin Asp Gly Arg Gly Pro Ser He Trp Asp 

20 25 30 

Thr Phe Cys Ala Gin Pro Gly Lys He Ala Asp Gly Ser Ser Gly Val 
35 40 45 



Thr Ala Cys Asp Ser Tyr Asn Arg Thr Ala Glu Asp He Ala Leu Leu 
50 55 60 



Lys Ser Leu Gly Ala Lys Ser Tyr Arg Phe Ser He Ser Trp Ser Arg 



y 
65 



70 



75 



80 



He He Pro Glu Gly Gly Arg Gly Asp Ala Val Asn Gin Ala Gly He 

85 90 95 
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Asp His Tyr Val Lys Phe Val Asp Asp Leu Leu Asp Ala Gly He Thr 

100 105 HO 

Pro Phe lie Thr Leu Phe His Trp Asp Leu Pro Glu Gly Leu His Gin 
115 120 125 

Arg Tyr Gly Gly Leu Leu Asn Arg Thr Glu Phe Pro Leu Asp Phe Glu 
130 135 140 

Asn ^yr Ala Arg Val Met Phe Arg Ala Leu Pro Lys Val Arg Asn Trp 
145 150 155 160 

He Thr Phe Asn Glu Pro Leu Cys Ser Ala He Pro Gly Tyr Gly Ser 

165 HO 175 

Gly Thr Phe Ala Pro Gly Arg Gin Ser Thr Ser Glu Pro Trp Thr Val 

180 185 190 

Gly His Asn lie Leu Val Ala His Gly Arg Ala Val Lys Ala Tyr Arg 
195 200 205 

Asp Asp Phe Lys Pro Ala Ser Gly Asp Gly Gin He Gly He Val Leu 
210 215 220 

Asn Gly Asp Phe Thr Tyr Pro Trp Asp Ala Ala Asp Pro Ala Asp Lys 
225 230 235 240 

Glu Ala Ala Glu Arg Arg Leu Glu Phe Phe Thr Ala Trp Phe Ala Asp 

245 250 255 

Pro He Tyr Leu Gly Asp Tyr Pro Ala Ser Met Arg Lys Gin Leu Gly 

260 265 270 

Asp Arg Leu Pro Thr Phe Thr Pro Glu Glu Arg Ala Leu Val His Gly 
275 280 285 

Ser Asn Asp Phe Tyr Gly Met Asn His Tyr Thr Ser Asn Tyr He Arg 
290 295 300 

His Arg Ser Ser Pro Ala Ser Ala Asp Asp Thr Val Gly Asn Val Asp 
305 ' 310 315 320 
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Val Leu Phe Thr Asn Lys Gin Gly Asn Cys lie Gly Pro Glu Thr Gin 

325 330 335 

Ser Pro Trp Leu Arg Pro Cys Ala Ala Gly Phe Arg Asp Phe Leu Val 

340 345 350 

Trp lie Ser Lys Arg Tyr Gly Tyr Pro Pro He Tyr Val Thr Glu Asn 
355 360 365 

Gly Thr Ser He Lys Gly Glu Ser Asp Leu Pro Lys Glu Lys He Leu 
370 375 380 

Glu Asp Asp Phe Arg Val Lys Tyr Tyr Asn Glu Tyr He Arg Ala Met 
385 390 395 400 

Val Thr Ala Val Glu Leu Asp Gly Val Asn Val Lys Gly Tyr Phe Ala 

405 410 415 

Trp Ser Leu Met Asp Asn Phe Glu Trp Ala Asp Gly Tyr Val Thr Arg 

420 425 430 

Phe Gly Val Thr Tyr Val Asp Tyr Glu Asn Gly Gin Lys Arg Phe Pro 
435 440 445 

Lys Lys Ser Ala Lys Ser Leu Lys Pro Leu Phe Asp Glu Leu lie Ala 
450 455 460 



Ala Ala 
465 



<210> 91 

<211> 2103 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Trichoderma reesei fi-Glucosidase D 



<220> 

<221> CDS 

<222> (1) . . (2103) 

<223> Trichoderma reesei fi-Glucosidase D 
<400> 91 

atg att etc ggc tgt gaa age aca ggt gtc ate tct gec gtc aaa cac 
Met He Leu Gly Cys Glu Ser Thr Gly Val He Ser Ala Val Lys His 
15 10 15 



48 
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ttt gtc gcc aac gac cag gag cac gag egg cga gcg gtc gac tgt etc 
Ph^ Val Ala Asn Asp Gin Glu His Glu Arg Arg Ala Val Asp Cys Leu 

20 25 30 

ate ace cag egg get etc egg gag gtc tat ctg cga ccc ttc cag ate 
lie Thr Gin Arg Ala Leu Arg Glu Val Tyr Leu Arg Pro Phe Gin lie 
35 40 45 

gta gcc cga gat gca agg ccc ggc gca ttg atg aca tec tac aac aag 
Val Ala Arg Asp Ala Arg Pro Gly Ala Leu Met Thr Ser Tyr Asn Lys 
50 55 60 

gtc aat ggc aag cac gtc get gac age gcc gag ttc ctt cag ggc att 
Val Asn Gly Lys His Val Ala Asp Ser Ala Glu Phe Leu Gin Gly lie 
65 70 75 80 

etc egg act gag tgg aat tgg gac cct etc att gtc age gac tgg tac 
Leu Arg Thr Glu Trp Asn Trp Asp Pro Leu He Val Ser Asp Trp Tyr 

85 90 95 

ggc ace tac acc act att gat gcc ate aaa gcc ggc ctt gat etc gag 
Gly Thr Tyr Thr Thr He Asp Ala He Lys Ala Gly Leu Asp Leu Glu 

100 105 110 

atg ccg ggc gtt tea cga tat cgc ggc aaa tac ate gag tct get ctg 
Met Pro Gly Val Ser Arg Tyr Arg Gly Lys Tyr He Glu Ser Ala Leu 
115 120 125 

cag gcc cgt ttg ctg aag cag tec act ate gat gag cgc get cgc cgc 
Gin Ala Arg Leu Leu Lys Gin Ser Thr He Asp Glu Arg Ala Arg Arg 
130 135 140 

gtg etc agg ttc gcc cag aag gcc age cat etc aag gtc tec gag gta 
Val Leu Arg Phe Ala Gin Lys Ala Ser His Leu Lys Val Ser Glu Val 
145 150 155 160 

gag caa ggc cgt gac ttc cca gag gat cgc gtc etc aac cgt cag ate 
Glu Gin Gly Arg Asp Phe Pro Glu Asp Arg Val Leu Asn Arg Gin He 

165 170 175 

tgc ggc age age att gtc eta ctg aag aat gag aac tec ate tta cct 
Cys Gly Spr Ser He Val Leu Leu Lys Asn Glu Asn Ser He Leu Pro 

180 185 190 

etc ccc aag tec gtc aag aag gtc gcc ctt gtt ggt tec cac gtg cgt 
Leu Pro Lys Ser Val Lys Lys Val Ala Leu Val Gly Ser His Val Arg 
195 200 205 

eta ccg get ate teg gga gga ggc age gcc tct ctt gtc cct tac tat 
Leu Pro Ala lie Ser Gly Gly Gly Ser Ala Ser Leu Val Pro Tyr Tyr 
210 215 220 

gcc ata tct eta tac gat gcc gtc tct gag gta eta gcc ggt gcc acg 
Ala He Ser Leu Tyr Asp Ala Val Ser Glu Val Leu Ala Gly Ala Thr 
225 230 235 240 



96 



ate acg cac gag gtc ggt gcc tat gcc cac caa atg ctg ccc gtc 



14 4 



192 



240 



288 



336 



384 



432 



480 



528 



576 



624 



672 



720 



ate 768 
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lie Thr His Glu Val Gly Ala Tyr Ala His Gin Met Leu Pro Val He 

245 ' 250 255 

gac gca atg ate age aac gee gta ate cac ttc tac aac gac ccc ate 
Asp Ala Met lie Ser Asn Ala Val He His Phe Tyr Asn Asp Pro lie 

260 265 270 

gat gtc aaa gac aga aag etc ett ggc agt gag aac gta teg teg aca 
Asp Val Lys Asp Arg Lys Leu Leu Gly Ser Glu Asn Val Ser Ser Thr 
275 280 285 

teg ttc cag etc atg gat tac aac aac ate cca acg etc aac aag gee 
Ser Phe Gin Leu Met Asp Tyr Asn Asn lie Pro Thr Leu Asn Lys Ala 

290 295 300 

ata ttc tgg ggt act etc gtg ggc gag ttt ate cct acc gee acg gga 
Met Phe Trp Gly Thr Leu Val Gly Glu Phe lie Pro Thr Ala Thr Gly 
305 310 315 320 

att tgg gaa ttt ggc etc agt gtc ttt ggc act gee gac ctt tat att 
lie Trp Glu Phe Gly Leu Ser Val Phe Gly Thr Ala Asp Leu Tyr He 

325 330 335 

qat aat gag etc gtg att gaa aat aca aca cat cag acg cgt gga acc 
Asp Asn Glu Leu Val lie Glu Asn Thr Thr His Gin Thr Arg Gly Thr 

340 345 350 

qcc ttt ttc gga aag gga acg acg gaa aaa gtc get acc agg agg atg 
Ala Phe Phe Gly Lys Gly Thr Thr Glu Lys Val Ala Thr Arg Arg Met 
355 360 365 

gtg gee ggc age acc tac aag ctg cgt etc gag ttt ggg tct gee aac 
Val Ala Gly Ser Thr Tyr Lys Leu Arg Leu Glu Phe Gly Ser Ala Asn 
370 375 380 

acg acc aag atg gag acg acc ggt gtt gtc aac ttt ggc ggc ggt gee 
Thr Thr Lys Met Glu Thr Thr Gly Val Val Asn Phe Gly Gly Gly Ala 
385 390 395 400 

gta cac ctg ggt gee tgt etc aag gtc gac cca cag gag atg att gcg 
Val His Leu Gly Ala Cys Leu Lys Val Asp Pro Gin Glu Met He Ala 

4 05 4 10 415 

cqq gee gtc aag gee gca gee gat gee gac tac acc ate ate tgc acg 
Arq Ala Val Lys Ala Ala Ala Asp Ala Asp Tyr Thr He He Cys Thr 

420 ^25 430 



gac ctg ccc cct ggt gtg gac acc atg ate teg caa gtt ctt gac gee 

Asp Leu Pro Pro Gly Val Asp Thr Met He Ser Gin Val Leu Asp Ala 

450 455 460 

qct ccc aat get gta gtc gtc aac cag tea ggc acc cca gtg aca atg 

Ala Pro Asn Ala Val Val Val Asn Gin Ser Gly Thr Pro Val Thr Met 

465 470 475 480 

Page 104 



816 



864 



912 



960 



1008 



1056 



1104 



1152 



1200 



1248 



1296 



gga etc age ggc gag tgg gag tct gag ggt ttt gac egg cct cac atg 1344 
Gly Leu S^r Gly Glu Trp Glu Ser Glu Gly Phe Asp Arg Pro His Met 
435" 440 445 



1392 
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age tgg get cat aaa gca aag gec att gtg cag get tgg tat ggt ggt 

Ser Trp Ala His Lys Ala Lys Ala He Val Gin Ala Trp Tyr Gly Gly 

4 85 4 90 4 95 

aac gag aca ggc cac gga ate tec gat gtg etc ttt ggc aac gtc aac 

Asn Glu Thr Gly His Gly He Ser Asp Val Leu Phe Gly Asn Val Asn 

500 505 510 

ccg teg ggg aaa etc tec eta teg tgg cca gtc gat gtg aag cac aac 

Pro Ser Gly Lys Leu Ser Leu Ser Trp Pro Val Asp Val Lys His Asn 

515 520 525 

cca gca tat etc aac tac gee age gtt ggt gga egg gtc ttg tat ggc 

Pro Ala Tyr Leu Asn Tyr Ala Ser Val Gly Gly Arg Val Leu Tyr Gly 

530 535 540 

gag gat gtt tac gtt ggc tac aag ttc tac gac aaa acg gag agg gag 

Glu Asp Val Tyr Val Gly Tyr Lys Phe Tyr Asp Lys Thr Glu Arg Glu 

545 550 555 560 

gtt ctg ttt cct ttt ggg cat ggc ctg tct tac get acc ttc aag etc 

Val Leu Phe Pro Phe Gly His Gly Leu Ser Tyr Ala Thr Phe Lys Leu 

565 570 575 

cca gat tct acc gtg agg acg gtc ccc gaa acc ttc cac ccg gac cag 

Pro Asp Ser Thr Val Arg Thr Val Pro Glu Thr Phe His Pro Asp Gin 

580 585 590 

ccc aca gta gee att gtc aag ate aag aac acg age agt gtc ccg ggc 

Pro Thr Val Ala He Val Lys He Lys Asn Thr Ser Ser Val Pro Gly 

595 600 605 

gec cag ate ctg cag tta tac att teg gee cca aac teg cct aca cat 

Ala Gin Val Leu Gin Leu Tyr He Ser Ala Pro Asn Ser Pro Thr His 

610 615 620 

cgc ccg gtc aag gag ctg cac gga ttc gaa aag gtg tat ctt gaa get 

Arg Pro Val Lys Glu Leu His Gly Phe Glu Lys Val Tyr Leu Glu Ala 

625 630 635 640 

ggc gag gag aag gag gta caa ata ccc att gac cag tac get act age 

Gly Glu Glu Lys Glu Val Gin He Pro lie Asp Gin Tyr Ala Thr Ser 

64 5 650 655 

ttc teg gac gag att gag age atg tgg aag age gag agg ggc att tat 

Phe Trp Asp Glu He Glu Ser Met Trp Lys Ser Glu Arg Gly He Tyr 

660 665 670 

gat gtg ctt gta gga ttc teg agt cag gaa ate teg ggc aag ggg aag 

Asp Val Leu Val Gly Phe Ser Ser Gin Glu He Ser Gly Lys Gly Lys 

675 680 685 

ctg att gtg cct gaa acg cga ttc tgg atg ggg ctg tag 

Leu He Val Pro Glu Thr Arg Phe Trp Met Gly Leu 

690 695 700 



1488 



1536 



158 4 



1632 



1680 



1728 



1776 



1824 



1872 



1920 



1968 



2016 



2064 



2103 
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<210> 92 

<211> 700 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic Construct 

<400> 92 

Met He Lpu Gly Cys Glu Ser Thr Gly Val He Ser Ala Val Lys His 
1 5 10 15 

Phe Val Ala Asn Asp Gin Glu His Glu Arg Arg Ala Val Asp Cys Leu 

20 25 30 

He Thr Gin Arg Ala Leu Arg Glu Val Tyr Leu Arg Pro Phe Gin He 
35 "0 45 

Val Ala Arg Asp Ala Arg Pro Gly Ala Leu Met Thr Ser Tyr Asn Lys 
50 55 60 

Val Asn Gly Lys His Val Ala Asp Ser Ala Glu Phe Leu Gin Gly He 
65 70 75 80 

Leu Arg Thr Glu Trp Asn Trp Asp Pro Leu He Val Ser Asp Trp Tyr 

85 90 95 

Glv Thr Tyr Thr Thr He Asp Ala He Lys Ala Gly Leu Asp Leu Glu 

100 105 HO 

Met Pro Gly Val Ser Arg Tyr Arg Gly Lys Tyr He Glu Ser Ala Leu 
115 120 125 

Gin Ala Arg Leu Leu Lys Gin Ser Thr He Asp Glu Arg Ala Arg Arg 
130 135 140 

Val Leu Arg Phe Ala Gin Lys Ala Ser His Leu Lys Val Ser Glu Val 
145 150 155 160 

Glu Gin Gly Arg Asp Phe Pro Glu Asp Arg Val Leu Asn Arg Gin He 

165 170 175 

Cvs Glv Ser Ser He Val Leu Leu Lys Asn Glu Asn Ser He Leu Pro 

180 185 190 
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Leu Pro Lys Ser Val Lys Lys Val Ala Leu Val Gly Ser His Val Arg 
195 200 205 

Leu Pro Ala lie Ser Gly Gly Gly Ser Ala Ser Leu Val Pro Tyr Tyr 
210 215 220 

Ala Hp Ser Leu Tyr Asp Ala Val Ser Glu Val Leu Ala Gly Ala Thr 
225 " 230 235 240 

lie Thr His Glu Val Gly Ala Tyr Ala His Gin Met Leu Pro Val He 

245 250 255 

Asp Ala Met He Ser Asn Ala Val He His Phe Tyr Asn Asp Pro He 

260 265 270 

Asp Val Lys Asp Arg Lys Leu Leu Gly Ser Glu Asn Val Ser Ser Thr 
275 280 285 

q er ph^ Gin Leu Met Asp Tyr Asn Asn He Pro Thr Leu Asn Lys Ala 
290 295 300 

Met Phe Trp Gly Thr Leu Val Gly Glu Phe He Pro Thr Ala Thr Gly 
305 310 315 320 

lie Trp Glu Phe Gly Leu Ser Val Phe Gly Thr Ala Asp Leu Tyr He 

325 330 335 

Asp Asn Glu Leu Val He Glu Asn Thr Thr His Gin Thr Arg Gly Thr 

340 345 350 

Ala Phe Phe Gly Lys Gly Thr Thr Glu Lys Val Ala Thr Arg Arg Met 
355 360 365 

Val Aia Gly Ser Thr Tyr Lys Leu Arg Leu Glu Phe Gly Ser Ala Asn 

370 375 380 

Thr Thr Lys Met Glu Thr Thr Gly Val Val Asn Phe Gly Gly Gly Ala 
385 390 395 400 

Val His Leu Gly Ala Cys Leu Lys Val Asp Pro Gin Glu Met He Ala 

405 410 415 

Ara Ala Val Lys Ala Ala Ala Asp Ala Asp Tyr Thr He He Cys Thr 

420 425 430 
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Gly Leu Ser Gly Glu Trp Glu Ser Glu Gly Phe Asp Arg Pro His Met 
435 440 445 

Asp Leu Pro Pro Gly Val Asp Thr Met lie Ser Gin Val Leu Asp Ala 
450 455 460 

Ala Pro Asn Ala Val Val Val Asn Gin Ser Gly Thr Pro Val Thr Met 
465 470 475 480 

Ser Trp Ala His Lys Ala Lys Ala He Val Gin Ala Trp Tyr Gly Gly 

485 490 4 95 

Asn Glu Thr Gly His Gly lie Ser Asp Val Leu Phe Gly Asn Val Asn 

500 505 510 

Pro Ser Gly Lys Leu Ser Leu Ser Trp Pro Val Asp Val Lys His Asn 
515 520 525 

Pro Ala Tyr Leu Asn Tyr Ala Ser Val Gly Gly Arg Val Leu Tyr Gly 
530 535 540 

Glu Asp Val Tyr Val Gly Tyr Lys Phe Tyr Asp Lys Thr Glu Arg Glu 
545 550 555 560 

Val Leu Phe Pro Phe Gly His Gly Leu Ser Tyr Ala Thr Phe Lys Leu 

565 570 575 

Pro Asp Ser Thr Val Arg Thr Val Pro Glu Thr Phe His Pro Asp Gin 

580 585 590 

Pro Thr Val Ala He Val Lys He Lys Asn Thr Ser Ser Val Pro Gly 
595 600 605 

Ala Gin Val Leu Gin Leu Tyr He Ser Ala Pro Asn Ser Pro Thr His 
610 615 620 

Arq Pro Val Lys Glu Leu His Gly Phe Glu Lys Val Tyr Leu Glu Ala 
625 630 635 640 

Glv Glu Glu Lys Glu Val Gin He Pro lie Asp Gin Tyr Ala Thr Ser 

64 5 650 655 
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Phe Trp Asp Glu He Glu Ser Met Trp Lys Ser Glu Arg Gly-Ile Tyr 

660 665 670 

Asp Val Leu Val Gly Phe Ser Ser Gin Glu He Ser Gly Lys Gly Lys 
675 680 685 



Leu He Val Pro Glu Thr Arg Phe Trp Met Gly Leu 
690 695 ^00 



<210> 93 

<211> 1496 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Maize optimized CBHI 



<400> 93 
tgcag t ccgc 


ctgcaccctc 


cagt ccgaga 


cccacccgcc 


gctcacctgg 


cagaagtget 


60 


cctccggcgg 


cacctgcacc 


cagcagaccg 


gctccgtggt 


ga tcgacgcc 


aact ggeget 


120 


ggacccacgc 


caccaactcc 


t ccaccaact 


getacgaegg 


caacacctgg 


tcct ccaccc 


180 


t ctgcccgga 


caacgagacc 


tgegecaaga 


act getgect 


cgacggcgcc 


gcctacgcct 


240 


ccacct acgg 


cgtgaccacc 


t ccggcaact 


ccctct ccat 


egget tegtg 


acccagt ccg 


300 


cccagaagaa 


cgtgggcgcc 


cgect ctacc 


t ca tggcetc 


cgacaccacc 


taccaggagt 


360 


t caccct cct 


eggcaacgag 


tt ctcct teg 


acgtggacgt 


gt cccagct c 


ccgt geggee 


420 


t caacggcgc 


cct ct act t c 


gtgt cca tgg 


acgccgacgg 


cggcgtgtcc 


aagt acccga 


480 


ccaacaccgc 


cggcgccaag 


tacggcaccg 


getactgega 


ctcccagtgc 


ccgcgcgacc 


540 


tcaagt t cat 


caacggccag 


gccaacgtgg 


agggctggga 


gccgtcctcc 


aacaacgcca 


600 


acaccggca t 


cggcggccac 


ggctcctgct 


gctccgaga t 


ggacatctgg 


gaggecaact 


660 


cca tctccga 


ggcect cacc 


ccgcacccg t 


gcaccaccgt 


gggecaggag 


at ct gcgagg 


720 


gcgacggct g 


cggcggcacc 


tactccgaca 


accgctacgg 


cggcacct gc 


gacccggacg 


780 


gctgcgact g 


gaacccgtac 


cgect eggea 


acacctcc 1 1 


ctacggcccg 


ggctcct cct 


840 


t caccctcga 


caccaccaag 


aagctcaccg 


t ggtgaccca 


gt tcgagacc 


t ccggcgcca 


900 


tcaaccgct a 


ct acgtgcag 


aacggcgtga 


cct tccagca 


gccgaacgcc 


gagctegget 


960 


cctact ccgg 


caacgagct c 


aacgacgact 


act gcaccgc 


egaggaggee 


gagttcggcg 


1020 


get cct cct t 


ct ccgacaag 


ggcggcctca 


cccag t tcaa 


gaaggccacc 


tccggcggca 


1080 


tggt getegt 


ga tgtccctc 


t gggacgact 


act aegecaa 


catget ctgg 


ct cgactcca 


1140 
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CCLaCCCydL. 




t cct cca ccc 


cqqqcqccg t 


qcqcggctcc 


t gctccacct 


1200 


cctccggcgt 


gccggcccag 


g t ggagtccc 


agt ccccgaa 


cgccaaggug 




12 60 

1 t. V w 


aca t caagt t 


cggcccga t c 


ggct ccaccg 


gcaacccgtc 


cggcggcaac 


ccgccgggcg 


1320 


gcaacccgcc 


gggcaccacc 


accacccgcc 


gcccggccac 


caccaccggc 


tcctccccgg 


1380 


qcccqaccca 


gtcccact ac 


ggccagtgcg 


gcggcat egg 


ctact cegge 


ccgaccgtgt 


1440 


gcgcc tccgg 


caccacct gc 


caggtgct ca 


acccgtact a 


ct cccagtgc 


ct ctag 


1496 



<210> 94 

<211> 1365 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Maize optimized CBHII 



<400> 94 



v 4 U U / -/ ^ 

atqqtqccgc 


t egaggageg 


ccaggcctgc 


tcctccgtg t 


ggggccagtg 


cggcggccag 


60 


aactqg tccg 


gcccgacctg 


ctgcgcct cc 


ggctccacct 


gcgtgt act c 


caacgact ac 


120 


LdCLCCCdy L 


y ll i_ ^— ^ \) \) \) 


caccacct cc 


tcctcct cct 


ccacccqcqc 


cgect ccacc 


180 


acct cccgcg 


t gtccccgac 


cacctcccgc 


t cctcctccg 


ccaccccgcc 


geeggget cc 


240 


accaccaccc 


gcgt gccgcc 


ggt gggctcc 


ggcaccgcca 


cctact cegg 


caacccgt tc 


300 


gtgggcg tga 


ccccg tgggc 


caacgcctac 


tacgect ccg 


aggtgt cctc 


cct cgccatc 


360 


ccgt ccct ca 


ccggcgcca t 


ggccaccgcc 


gccgccgccg 


tggccaaggt 


geegt cct t c 


420 


a tgtggct eg 


acaccct cga 


caagaccccg 


ctcatggagc 


agaccct cgc 


cgacatccgc 


480 


accqccaaca 


agaaeggegg 


caactacgcc 


ggccagt teg 


tggtgtacga 


cctcccggac 


540 


cgcgactgcg 


ccgccctcgc 


ctccaacggc 


gag tact cca 


tcgccgacgg 


cggcgtggcc 


600 


aag t acaaqa 


act acatcga 


caccatccgc 


cagatcgtgg 


tggagt act c 


cgacatccgc 


660 


accctcct eg 


tga tegagee 


ggact ccct c 


gccaacc teg 


tgaccaacct 


cggcaccccg 


720 


aagtgcgcca 


acgcccagt c 


cgect acctc 


gagt gca t ca 


actacgccgt 


gacccagct c 


780 


aacct cccga 


aegt ggeca t 


gtacctcgac 


gccggccacg 


ccggctggct 


egget ggccg 


840 


gccaaccagg 


acccggccgc 


ccagct ct tc 


gccaacgtgt 


acaagaaege 


ctcct ccccq 


900 


cgcgccctcc 


geggect cgc 


caccaacgtg 


gccaactaca 


acggctggaa 


ca tcacctcc 


960 


ccgccg tcct 


acacccaggg 


caacgccgtg 


t acaacgaga 


agctctaca t 


ccacqcca t c 


1020 


ggcccgct cc 


t cgccaacca 


cggctggt cc 


aacgect tct 


tcatcaccga 


ccagggccgc 


1080 
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t ccggcaaqc 


agccgaccgg 


ccagcagcag 


ggct tcggca 


t ccgcccgtc 


cgccaacacc 


gt gaagccgg 


gcggcgagt g 


cgacggcacc 




t cccggacgc 


cct ccagccg 


t act tcgtgc 


agct cctcac 


caacgccaac 


<210> 95 
<211> 131" 
<212> DNA 
<213> Arti 


.ficial Sequence 


<220> 

<223> Maize optimized EGLI 


<400> 95 
at gcagcagc 


cgggcacct c 


caccccggag 


accaagt ccg 


gegget gcgt 


ggcccaggac 


tgga tgcacg 


acgccaacta 


CaaCLCCtyL 


tgcccggacg 


aggccacct g 


/— » /-f -\ -3 <-| 3 T p 

CyyCadyddL 


t ccggcgt ga 


ccacctccgg 




ggcggctact 


cctccgtgtc 


cccgcgcct c 


a t gctcaagc 


t caacggcca 


ggagctctcc 


ggcgagaacg 


gctccct ct a 


cctct cccag 


aacaccgccg 


gcgccaacta 


egget cegge 


tggcgcaacg 


gcaccct caa 


cacct cccac 


ct cgagggca 


act cccgcgc 


caacgccct c 


gact ccgccg 


getgegget t 


caacccgtac 


ggcgacaccg 


t ggacacctc 


caagacct tc 


ggct ccccg t 


ccggcaacct 


cgt gt ccatc 


a tcccgtccg 


cccagccggg 


cggcgacacc 


ggcggcct eg 


ccaccatggg 


caaggccct c 


t ggaacgaca 


actcccagta 


ca tgaactgg 


tccaccgagg 


gcaacccg tc 


caacaccctc 


t ccaaca tec 


getggggega 


cat cggctcc 



uggggcgacu 


yy tgcaaoy l 




114 0 


ggegact ccc 


tcctcgactc 


ct t eg tgtgg 


1200 


tccgact cct 


ccgccccgcg 


ct tcgact cc 


1260 


gccccgcagg 


ccggcgcctg 


gt tccaggcc 


1320 


ccgtcct tec 


tct ag 




13'65 



g tgcacccga 


agctcaccac 


ctacaagtgc 


60 


acct ccgtgg 


tgetcgae tg 


gaactaccgc 


120 


accgt gaacg 


gcggcgtgaa 


caccaccctc 


180 


tgettea teg 


agggegt gga 


ctacgccgcc 


240 


O *w \ O l. y a a ^» v — 


aatacatacc 


a t cctcct cc 


300 


tacct cctcg 


act ccgaegg 


egagtaegt g 


360 


1 1 cgacgtgg 


acct ct cege 


cct cccgtgc 


420 


at ggacgaga 


acggcggcgc 


caaccagtac 


480 


t actgegacg 


cccagtgccc 


ggtgcagacc 


540 


cagggct tct 


getgeaaega 


gatggaca t c 


600 


accccgcact 


cctgcaccgc 


caccgcctgc 


660 


ggct cegget 


acaagtccta 


ctacggcccg 


720 


accat ca t ca 


cccagt tcaa 


caccgacaac 


780 


acccgcaagt 


accagcagaa 


eggegt gga c 


840 


a t ctcctcct 


gcccgtccgc 


ctccgcctac 


900 


t cctccggca 


tggtgct cgt 


gttctccat c 


960 


ctcgactccg 


gcaacgccgg 


cccgtgctcc 


1020 


gccaacaacc 


cgaacaccca 


cgtggtgttc 


1080 


accaccaact 


ccaccgcccc 


gccgccgccg 


1140 
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ccggcct cct 


ccacca cct t 


ctccaccacc 


cgccgctcct 


ccaccacct c 


ctcct ccccq 


1200 


t cctgcaccc 


agacccact g 


gggccag tgc 


ggeggcateg 


get actcegg 


ctgcaagacc 


1260 


t gcacc tccg 


gcaccacctg 


ccagtact cc 


aacgact ac t 


actcccagtg 


cctctag 


1317 


<210> 96 
<211> 1401 
<212> DNA 

<213> Artificial Sequence 








• 


<220> 

<22 3> Mai z 


:e optimizec 


1 BGLII 










<400> 96 
a tgctcccga 


aggact t cca 


gtggggctt c 


gccaccgccg 


cctaccaga t 


cgagggcgcc 


60 


gt ggaccagg 


acggccgcgg 


cccgtcca tc 


t gggacacct 


tctgcgccca 


geegggcaag 


120 


atcgccgacg 


gctcctccgg 


cgt gaccgcc 


tgcgactcct 


acaaccgcac 


cgccgaggac 


180 


a t cgccct cc 


tcaagt ccct 


cggcgccaag 


tcctaccgct 


tctccat etc 


ctgg t cccgc 


240 


a t ca tcccgg 


agggeggecg 


cggcgacgcc 


gt gaaccagg 


ceggcat cga 


ccactacgtg 


300 


aagt tcgtgg 


acgacc t cc t 


cgacgccggc 


at caccccgt 


teat caccct 


ct tccactgg 


360 


gacct cccgg 


agggect cca 


ccagcgctac 


ggcggcctcc 


t caaccgcac 


cgagt tcccg 


420 


ct cgac 1 1 eg 


agaactaege 


ccgcgtga tg 


tt ccgcgccc 


tcccgaaggt 


gcgcaactgg 


480 


a tcaccttca 


acgagccgct 


ctget ccgcc 


a tcccgggct 


aegget cegg 


caccttcgcc 


540 


ccgggccgcc 


ag tccacct c 


cgagccgtgg 


accgtgggcc 


acaacat cct 


eg t ggcccac 


600 


aaccacaccq 

\^ *«_«■ X— * »w ^-^ 


tqaaqgecta 


ccgcgacgac 


t teaagcegg 


cctccggcga 


cggccagatc 


660 


ggcat eg tgc 


teaaeggega 


ct t cacctac 


ccgtgggacg 


ccgccgaccc 


ggccgacaag 


720 


gaggccgccg 


agcgccgcc t 


cgagt tct t c 


accgcctgg t 


tcgccgaccc 


ga tctacct c 


780 


ggegae t acc 


eggect cca t 


gcgcaagcag 


ctcggcgacc 


gcctcccgac 


ctt caccccg 


840 


gaggagegeg 


ccctcgt gca 


cggctccaac 


gacttctacg 


gca tgaacca 


ct acacctcc 


900 


aact acat cc 


gccaccgctc 


ctccccggcc 


tccgccgacg 


acaccgtggg 


caacgtggac 


960 


gtget ct tea 


ccaacaaqca 


gggcaact gc 


a t coacccqq 


aqacccagtc 


cccg tggct c 


1020 


cgcccgtgcg 


ccgccggct t 


ccgcgact t c 


ctcgtgtgga 


tctccaagcg 


ctacggctac 


1080 


ccgccga tct 


aegtgacega 


gaacggcacc 


t ccatcaagg 


gegagtcega 


cctcccgaag 


1140 


gagaaga tec 


t cgaggacga 


ct tecgegtg 


aag tactaca 


acgagtaca t 


ccgcgccat g 


1200 


gtgaccgccg 


tggagct cga 


eggegt gaac 


gtgaagggct 


acttcgcctg 


gtccctcatg 


1260 
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gacaact teg 


ag tgggcega 


eggctaegtg 


acccgct teg 


gcgtgacct a 


cgtggact ac 


1320 


gagaaeggee 


agaageget t 


cccgaagaag 


t ccgccaagt 


ccctcaagcc 


get ct t cgac 


1380 


gacctcatcg 


ccgccgccta 


9 








1401 


<210> 97 
<211> 2103 
<212> DNA 

<213> Artificial Sequence 








- 


<220> 

<223> Maize optimizec 


1 CEL3D 










<400> 97 
atgat cctcg 


get gcgagt c 


caccggcgtg 


at ctccgccg 


tgaagcact t 


cgtggccaac 


60 


gaccaggagc 


acgagcgccg 


cgccgtggac 


tgcctcatca 


cccagcgcgc 


cctccgcgag 


120 


g t gt acct cc 


gecegt t cca 


ga tcgtggcc 


cgcgacgccc 


gcccgggcgc 


cct ca t gacc 


180 


tcctacaaca 


aggtgaaegg 


caagcaegtg 


gecgact ccg 


ccgagt tcct 


ccagggca tc 


240 


c tccgcaccg 


agt ggaactg 


ggacccgct c 


atcgtgtccg 


actggtaegg 


cacct acacc 


300 


acca t cgacg 


cca tcaaggc 


eggect cgac 


etcgagatge 


cgggcgtgtc 


ccgctaccgc 


360 


gcj.caagt aca 


t cgagtccgc 


cct ccaggcc 


cgcctcctca 


agcagt ccac 


ca tcgacgag 


420 


cgcgcccgcc 


gcgtgct ccg 


ct t cgcccag 


aaggect ccc 


acctcaaggt 


gt ccgaggtg 


480 


gagcagggee 


gegae t tccc 


ggaggacege 


gt get caacc 


gecaga tctg 


cggctcctcc 


540 


atcgtgctcc 


tcaagaacga 


gaactccatc 


ct cccgctcc 


cgaag t ccgt 


gaagaaggt g 


600 


geect eg tgg 


gctcccacgt 


nrnrrt" rrrn 


gecat ct ccg 


geggegget c 


cgect ccct c 


660 


gtgccg tact 


acgccatctc 


cc t ctacgac 


gccgtgt ccg 


aggtgetege 


cggcgccacc 


720 


a t cacccacg 


aggtgggcgc 


ct a cgcccac 


cagat gctcc 


cggtga tcga 


cgccatga tc 


780 


t ccaacgccg 


tgat ccact t 


ctacaacgac 


ccga tcgacg 


tgaaggaccg 


caagct cc t c 


840 


ggct ccgaga 


acg tgt cctc 


cacctcct t c 


cagctcatgg 


act acaacaa 


ca tcccgacc 


900 


ct caacaagg 


ccatgt tctg 


gggcaccctc 


gtgggcgagt 


tea tcccgac 


cgccaccggc 


960 


a tctgggagt 


t cggcctctc 


cgtgtt egge 


accqccgacc 


tctacatcga 


caacgagctc 


1020 


gtgat cgaga 


acaccaccca 


ccagacccgc 


ggcaccgcct 


tct teggcaa 


gggcaccacc 


1080 


gagaaggtgg 


ccacccgccg 


catggtggcc 


ggctccacct 


acaagctccg 


cctcgagtt c 


1140 


ggct ccgcca 


acaccaccaa 


ga tggagacc 


accggcgtgg 


tgaact t egg 


cggcggcgcc 


1200 


g t gcacct eg 


gegect gect 


caagg tggac 


cegcaggaga 


t gatcgcccg 


cgccgt gaag 


1260 
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QCCQCCQCCQ 

\M N_- >w> 


acaccoa eta 


caeca tea tc 


aaaaart tea 


accacccaca 


ca tggacct c 




ccaccccaaa 


cqccqt qq tg 


s_ v_ — L \J U ^ \— 


SCaaOGCCaa 


oqccatcqtq 


^ C* V V*J \_J V^> Ot L- ^ V. 


ccaacatoct 


ct teggcaac 




sratoaaaca 

C* v^* ^— ^-j ^ v -' ^ 


caacccqqcc 




acaaaaacat 


qtacqt qqqc 




cat t caa cca 


cggcctctcc 


nt a n n p Pi c ra 


t accaaa aa c 


ct t ccacccg 


t-l N-J t-4 V— * n— - W 


cc t ccqtqcc 


qqgcgcccag 


t ccccgaccc 


accgcccggt 


gaaggagctc 


ggegaggaga 


aggaggt gca 


gatcccga t c 


a t cgagt cca 


tgtggaag tc 


cgagcgcggc 


caggaga t ct 


ccggcaaggg 


caagct ca tc 


tag 






<210> 98 
<211> 420 
<212> DNA 
<213> Zea 


mays 





<220> 

<223> Q protein promoter 
<400> 98 

gggctggtaa attacttggg agcaatggta 
ctagttgtca caagttgtat ategattegt 
acaagtaaca caatattaca aagttagttt 
tgacatgtaa agtgaagctt attatacttc 
acaaaggtcc aaaaatccac atcaaccatt 
attatccaac aaatgtttat caatgtggta 
agaaattgtg ccaacaaagc tataagtaac 

<210> 99 
<211> 1188 



tgcaccggcc 


tct ceggega 


gt gggagtcc 


1320 


ccgccgggcg 


tggacaccat 


gat ctcccag 


1380 


gtgaaccag t 


ccggcacccc 


ggtgaccatg 


1440 


caggect gg t 


aeggeggcaa 


cgagaccggc 


1500 


gtgaacccgt 


ccggcaagct 


ct ccctctcc 


15 60 


t acct caact 


acqcctccqt 


gggcggccgc 


1620 


t acaagttct 


acgacaagac 


egagegegag 


1680 


t acgccacct 


tcaagct ccc 


ggactccacc 


1740 


qaccagccga 


ccgt ggecat 


cgtgaagatc 


1800 


gt gctccagc 


tctacatctc 


cgccccgaac 


1860 


caegget teg 


agaaggtgta 


cctcgaggcc 


1920 


gaccagt acg 


ccacctcct t 


ctgggacgag 


1980 


atctacgacg 


tgctcgtggg 


ct t ctcct cc 


2040 


gt geeggaga 


cccgct tctg 


ga tgggect c 


2100 
2103 



tgcaaat cct 


t tgcatgt ac 


gcaaaactag 


60 


cgcgtt tcaa 


caactcat gc 


aacat tacaa 


120 


ca tacaaagc 


aagaaaagga 


caa t aa tact 


180 


ctaat ccaac 


acaaaacaaa 


aaaaagt tgc 


240 


aacct atacg 


taaagtgagt 


ga tgagtcac 


300 


tea tacaaqc 


at tgacatcc 


cataaa tgea 


360 


cct ca tatgt 


atttgeaetc 


at gcat caca 


420 
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<212> DNA 

<213> artificial sequence 

<220> 

<223> synthetic ferulic acid esterase 



< 4 uu^ y y 

a tggccgcct 


ccct cccgac 


ca tgccgccg 


t ccggctacg 


accaggtgcg 


caaeggegtg 


60 


ccgcgcggcc 


aggt gg tgaa 


ca tctcctac 


t tctccaccg 


ccaccaactc 


cacccgcccg 


120 


gcccgcgt g t 


acc t cccqcc 


gggctact cc 


aaggacaaga 


agtact ccgt 


get ctacctc 


180 


ct ccacggca 


t eggegget c 


cgagaacgac 


t ggttcgagg 


gcggcggccg 


cgccaacgt g 


240 


a t cgccgaca 


acct ca tege 


cgagggcaag 


at caagccgc 


teat catcgt 


gaccccgaac 


300 


accaacqccg 


ccggcccggg 


ca t cqccqac 


ggctacgaga 


act t caccaa 


ggacctcctc 


360 


aac t ccct ca 


t cccgtaca t 


cgagtccaac 


tactccgt gt 


acaccgaccg 


cgagcaccgc 


420 


gcca t cgccg 


gect ct ct a t 


gggeggegge 


cagtcct tea 


acat eggect 


caccaacct c 


480 


gacaagt teg 


cctaca t egg 


cccgatct cc 


gccgccccga 


acacctaccc 


gaacgagege 


540 


ct ct tecegg 


aeggeggcaa 


ggccgcccgc 


gagaagct ca 


agctcct ct t 


catcgcctgc 


600 


ggca ccaacg 


act ccctca t 


egget tegge 


cagcgcgtgc 


acgagt actg 


cgtggccaac 


660 


aaca t caacc 


aegt gt act g 


get ca tccag 


ggcggcggcc 


acgact tcaa 


cgtgtggaag 


720 


ccgggcct ct 


ggaac 1 1 cct 


ccaqa tggee 


gaegaggecg 


gect cacccg 


cgacggcaac 


780 


accccggtgc 


cgaccccgt c 


cccgaagccg 


gccaacaccc 


gcatcgaggc 


cgaggactac 


840 


gaeggcat ca 


act cctcctc 


catcgaga tc 


a tcggcgtgc 


cgccggaggg 


cggccgcggc 


900 


a tegget aca 


t cacctccgg 


cgact acctc 


g tgtacaagt 


ccatcgact t 


cggcaacggc 


960 


gccacct cc t 


teaaggecaa 


ggt ggecaac 


gccaacacct 


ccaaca tcga 


gcttcgcctc 


1020 


aacggcccga 


acggcaccct 


ca t cggcacc 


ctctccgtga 


agtccaccgg 


cgact ggaac 


1080 


acct acgagg 


agcagacc tg 


ct ccatct cc 


aaggtgaccg 


gcat caacqa 


cctctacct c 


1140 


gtg t tcaagg 


geceggt gaa 


ca tcgact gg 


t tcacctt eg 


gcgtgtag 




1188 



<210> 100 

<211> 395 

<212> PRT 

<213> artificial sequence 
<220> 

<223> synthetic ferulic acid esterase 

<400> 100 
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Met Ala Ala Ser 
1 



Arg Asn Gly Val 

20 



Thr Ala Thr Asn 
35 



Tyr Ser Lys Asp 
50 



Gly Gly Ser Glu 

65 



He Ala Asp Asn 



Val Thr Pro Asn 

100 



Glu Asn Phe Thr 
115 



Ser Asn Tyr Ser 
130 



Leu Ser Met Gly 
145 



Asp Lys Phe Ala 



Pro Asn Glu Arg 

180 



Leu Lys Leu Leu 
195 



Phe Gly Gin Arg 
210 



Val Tyr Trp Leu 
22 5 



Leu Pro Thr Met 
5 



Pro Arg Gly Gin 



Ser Thr Arg Pro 

40 



Lys Lys Tyr Ser 
55 



Asn Asp Trp Phe 

70 



Leu He Ala Glu 
85 



Thr Asn Ala Ala 



Lys Asp Leu Leu 

120 



Val Tyr Thr Asp 
135 



Gly Gly Gin Ser 
150 



Tyr He Gly Pro 
165 



Leu Phe Pro Asp 



Phe He Ala Cys 

200 



Val His Glu Tyr 
215 



He Gin Gly Gly 

230 



Pro Pro Ser Gly 
10 



Val Val Asn He 
25 



Ala Arg Val Tyr 



Val Leu Tyr Leu 

60 



Glu Gly Gly Gly 
75 



Gly Lys He Lys 
90 



Gly Pro Gly He 
105 



Asn Ser Leu He 



Arg Glu His Arg 

140 



Phe Asn He Gly 
155 



He Ser Ala Ala 
170 



Gly Gly Lys Ala 
185 



Gly Thr Asn Asp 



Cys Val Ala Asn 

22 0 



Gly His Asp Phe 
235 



Tyr Asp Gin Val 
1-5 



Ser Tyr Phe Ser 
30 



Leu Pro Pro Gly 
45 



Leu His Gly lie 



Arg Ala Asn Val 

80 



Pro Leu He He 
95 



Ala Asp Gly Tyr 
110 



Pro Tyr He Glu 
125 



Ala He Ala Gly 



Leu Thr Asn Leu 

160 



Pro Asn Thr Tyr 
175 



Ala Arg Glu Lys 
190 



Ser Leu He Gly 
205 



Asn lie Asn His 



Asn Val Trp Lys 

240 
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Pro Gly Leu Trp Asn Phe Leu Gin Met Ala Asp Glu Ala Gly Leu Thr 

245 250 255 

Arg Asp Gly Asn Thr Pro Val Pro Thr Pro Ser Pro Lys Pro Ala Asn 

260 265 270 

Thr Arg He Glu Ala Glu Asp Tyr Asp Gly He Asn Ser Ser Ser He 
275 280 285 

Glu He He Gly Val Pro Pro Glu Gly Gly Arg Gly He Gly Tyr He 
290 295 300 

Thr Ser Gly Asp Tyr Leu Val Tyr Lys Ser He Asp Phe Gly Asn Gly 
305 310 315 320 

Ala Thr Ser Phe Lys Ala Lys Val Ala Asn Ala Asn Thr Ser Asn He 

325 330 335 

Glu Leu Arg Leu Asn Gly Pro Asn Gly Thr Leu He Gly Thr Leu Ser 

340 345 350 

Val Lys Ser Thr Gly Asp Trp Asn Thr Tyr Glu Glu Gin Thr Cys Ser 
355 360 365 

He Ser Lys Val Thr Gly He Asn Asp Leu Tyr Leu Val Phe Lys Gly 
370 375 380 

Pro Val Asn He Asp Trp Phe Thr Phe Gly Val 
385 390 395 



<210> 101 
<211> 1188 
<212> DNA 

<213> artificial sequence 
<220> 

<223> plasmid 13036 
<400> 101 

atggccgcct ccctcccgac catgccgccg tccggctacg accaggtgcg caacggcgtg 
ccgcgcggcc aggtggtgaa catctcctac ttctccaccg ccaccaactc cacccgcccg 
gcccgcgtgt acctcccgcc gggctactcc aaggacaaga agtactccgt gctctacctc 



60 
120 
180 



ctccacggca tcggcggctc cgagaacgac tggttcgagg gcggcggccg cgccaacgtg 240 
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a t cgccgaca 


acctcat cgc 


cgagggcaag 


atcaagccgc 


tcatca tegt 


gaccccgaac 


300 


accaacqccg 


ccqgcccggg 


ca t cgccgac 


ggctacgaga 


act tcaccaa 


ggacctcctc 


360 


aactccctca 


tcccqtacat 


cgagt ccaac 


t actccgtgt 


acaccgaccg 


cgagcaccgc 


420 


gcca t cgccg 


gcctctcta t 


gggeggegge 


cagtcct tea 


acat eggect 


caccaacct c 


480 


gacaagt t eg 


cctaca tegg 


cccgat ctcc 


gccgccccga 


acacctaccc 


gaacgagege 


54 0 


ctct tcccqq 


acqgcqgcaa 


ggccgcccgc 


gagaagct ca 


agctcct ct t 


catcgcctgc 


600 


aacaccaaca 


act ccct cat 


egget tegge 


cagcgcgtgc 


acgagt actg 


cgt ggecaac 


660 


aacatcaacc 


acqtg tactg 


qct cat ccag 


ggcggcggcc 


acgact t caa 


cgt gt ggaag 


720 


ccq qq cc t ct 


qqaact t cct 


ccagat ggee 


gaegaggecg 


gect cacccg 


cgacggcaac 


780 


accccqqt gc 


cgaccccgtc 


cccgaagccg 


gccaacaccc 


gca tcgaggc 


cgaggact a c 


840 


gaeggea t ca 


actcctcctc 


catcgaga tc 


ateggegt gc 


cgccggaggg 


cggccgcggc 


900 


ateggctaca 


+- /—* **i /™* / — « 4— /~» r~\ (~\ 

tcaccLCtyy 




nt ntacaaat 


ccatcaac 1 1 


cqqcaacqqc 


960 


gccacct cc t 


teaaggecaa 


ggtggccaac 


gccaacacct 


ccaaca tcga 


get tcgcctc 


1020 


aacggcccga 


acggcaccct 


ca t cggcacc 


ct ct ccgtga 


agt ccaccgg 


cgactggaac 


1080 


acct acgagg 


agcagacctg 


ct ccatctcc 


aaggt gaccg 


gca tcaacga 


cctctacct c 


1140 


gtgtt caagg 


gcccggtgaa 


ca tcgactgg 


t tcacct teg 


gcgtg tag 




1188 



<210> 102 

<211> 395 

<212> PRT 

<213> artificial sequence 

<220> 

<223> plasmid 13036 

<400> 102 



Met Ala Ala Ser Leu Pro Thr Met Pro Pro Ser Gly Tyr Asp Gin Val 
15 10 15 

Arq Asn Gly Val Pro Arg Gly Gin Val Val Asn He Ser Tyr Phe Ser 

20 25 30 

Thr Ala Thr Asn Ser Thr Arg Pro Ala Arg Val Tyr Leu Pro Pro Gly 
35 40 45 

Tyr Ser Lys Asp Lys Lys Tyr Ser Val Leu Tyr Leu Leu His Gly He 
50 ' 55 60 
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Gly Gly Ser Glu Asn Asp Trp Phe Glu Gly Gly Gly Arg Ala Asn Val 
65 70 75 80 

lie Ala Asp Asn Leu lie Ala Glu Gly Lys lie Lys Pro Leu He He 

85 90 95 

Val Thr Pro Asn Thr Asn Ala Ala Gly Pro Gly He Ala Asp Gly Tyr 

100 105 HO 

Glu Asn Phe Thr Lys Asp Leu Leu Asn Ser Leu He Pro Tyr He Glu 
115 120 125 

Ser Asn Tyr Ser Val Tyr Thr Asp Arg Glu His Arg Ala He Ala Gly 
130 135 140 

Leu Ser Met Gly Gly Gly Gin Ser Phe Asn He Gly Leu Thr Asn Leu 
145 150 155 160 

Asp Lys Phe Ala Tyr lie Gly Pro lie Ser Ala Ala Pro Asn Thr Tyr 

165 170 175 

Pro Asn Glu Arg Leu Phe Pro Asp Gly Gly Lys Ala Ala Arg Glu Lys 

180 185 190 

Leu Lys Leu Leu Phe He Ala Cys Gly Thr Asn Asp Ser Leu He Gly 
195 200 205 

Phe Gly Gin Arg Val His Glu Tyr Cys Val Ala Asn Asn lie Asn His 
210 215 220 

Val Tyr Trp Leu lie Gin Gly Gly Gly His Asp Phe Asn Val Trp Lys 
225 230 235 240 

Pro Gly Leu Trp Asn Phe Leu Gin Met Ala Asp Glu Ala Gly Leu Thr 

245 250 255 

Arq Asp Gly Asn Thr Pro Val Pro Thr Pro Ser Pro Lys Pro Ala Asn 

260 265 270 

Thr Arg lie Glu Ala Glu Asp Tyr Asp Gly lie Asn Ser Ser Ser lie 
275 280 285 
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Glu lie He Gly Val Pro Pro Glu Gly Gly Arg Gly He Gly Tyr He 
290 295 300 

Thr Ser Gly Asp Tyr Leu Val Tyr Lys Ser He Asp Phe Gly Asn Gly 
305 L 310 315 320 



Ala Thr Ser Phe Lys Ala Lys Val Ala Asn Ala Asn Thr Ser Asn He 

325 330 335 



Glu Leu Arg Leu Asn Gly Pro Asn Gly Thr Leu lie Gly Thr Leu Ser 

340 345 350 

Val Lys Ser Thr Gly Asp Trp Asn Thr Tyr Glu Glu Gin Thr Cys Ser 

355 360 365 

He Ser Lys Val Thr Gly lie Asn Asp Leu Tyr Leu Val Phe Lys Gly 

370 375 380 



Pro Val Asn lie Asp Trp Phe Thr Phe Gly Val 
385 390 395 



<210> 103 

<211> 1245 

<212> DNA 

<213> artificial sequence 
<220> 

<223> plasmid 13038 



<400> 103 
atgagggtgt 


tgctcgt tgc 


cctcgct etc 


ctggctc t eg 


ctgcgagcgc 


cacct ccatg 


60 


gccgcc t ccc 


t cccqacca t 


gccgccgtcc 


ggctacgacc 


aggtgcgcaa 


cggcgtgccg 


120 


cgcggccagg 


t gg tgaaca t 


ct cct act tc 


tccaccqcca 


ccaactccac 


ccgcccggcc 


180 


cgcg tgt acc 


tcccgccggg 


ctactccaag 


gacaagaagt 


actccgtgct 


ctacct cctc 


240 


cacggca teg 


gcggctccga 


gaacgactgg 


1 1 egagggeg 


gcggccgcgc 


caacgtga tc 


300 


gccgacaacc 


t catcgccga 


gggcaagatc 


aagccgctca 


tcatcg tgac 


cccgaacacc 


360 


aacgccgccg 


gcccgggca t 


cgccgacggc 


tacgagaact 


tcaccaagga 


cctcctcaac 


420 


tccctca tec 


cgtaca t cga 


gt ccaactac 


tccgtgtaca 


ccgaccgcga 


gcaccqcgcc 


480 


a t cgccggcc 


tctct atggg 


cggcggccag 


t ccttcaaca 


tcggcctcac 


caacct cgac 


540 


aag t tegect 


aca tcggccc 


ga tctccgcc 


gccccgaaca 


cctacccgaa 


cgagcgcct c 


600 


1 1 cccggacg 


geggcaagge 


cgcccgcgag 


aagct caagc 


t cctct teat 


cgect gegge 


660 
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a ccaacaa c t 


ccct ca t egg 


ct teggecag 


cgcgt gcacg 


agtactgcgt 


ggccaacaac 


720 


atcaa cca ca 


t gtact ggct 


catccagggc 


ggcggccacg 


act teaaegt 


gtggaagccg 


780 


qqcctctqqa 


act t cct cca 


qat ggecgae 


gaggccggcc 


tcacccgcga 


cggcaacacc 


840 


ccqq t qccqa 


ccccgtcccc 


qaagccggcc 


aacacccgca 


tegaggcega 


ggact acgac 


900 


ggca t caact 


cctcct cca t 


cgagat ca tc 


ggcgtgccgc 


eggagggegg 


ccgcggca tc 


960 


ggc taca t ca 


cctccggcga 


c t a cc t eg t g 






\ • O O w VJ \ — \*j \— U 


1020 

-L. w i— V-/ 


acctcct tea 


aggecaaggt 


ggccaacgcc 


aacacctcca 


acat cgagct 


tcgcctcaac 


1080 


ggcccgaa eg 


gcaccct ca t 


eggea ccctc 


t ccgtgaagt 


ccaccggcga 


ctggaacacc 


1140 


t acgaggagc 


agacct gctc 


catct ccaag 


gtgaceggea 


tcaacgacct 


ctacct cgtg 


1200 


t t caagggee 


eggt gaacat 


cgactggt tc 


acct teggeg 


t gtag 




124 5 



<210> 104 
<211> 414 
<212> PRT 

<213> artificial sequence 
<220> 

<223> plasmid 13038 aa 
<400> 104 

Met Arg Val Leu Leu Val Ala Leu Ala Leu Leu Ala Leu Ala Ala Ser 
15 10 15 

Ala Thr Ser Met Ala Ala Ser Leu Pro Thr Met Pro Pro Ser Gly Tyr 

20 25 30 

Asp Gin Val Arg Asn Gly Val Pro Arg Gly Gin Val Val Asn He Ser 
35 4 0 4 5 

Tyr Phe Ser Thr Ala Thr Asn Ser Thr Arg Pro Ala Arg Val Tyr Leu 
50 55 60 

Pro Pro Gly Tyr Ser Lys Asp Lys Lys Tyr Ser Val Leu Tyr Leu Leu 

65 70 75 80 

His Gly He Gly Gly Ser Glu Asn Asp Trp Phe Glu Gly Gly Gly Arg 

85 90 95 

Ala Asn Val He Ala Asp Asn Leu He Ala Glu Gly Lys lie Lys Pro 

100 105 HO 
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Leu lie He Val Thr Pro Asn Thr Asn Ala Ala Gly Pro Gly He Ala 
115 120 125 

Asp Gly Tyr Glu Asn Phe Thr Lys Asp Leu Leu Asn Ser Leu He Pro 
130 135 140 

Tyr He Glu Ser Asn Tyr Ser Val Tyr Thr Asp Arg Glu His Arg Ala 
145 150 155 160 

He Ala Gly Leu Ser Met Gly Gly Gly Gin Ser Phe Asn He Gly Leu 

165 170 175 

Thr Asn Leu Asp Lys Phe Ala Tyr He Gly Pro He Ser Ala Ala Pro 

180 185 190 

Asn Thr Tyr Pro Asn Glu Arg Leu Phe Pro Asp Gly Gly Lys Ala Ala 
195 200 205 

Arq Glu Lys Leu Lys Leu Leu Phe He Ala Cys Gly Thr Asn Asp Ser 
210 215 220 

Leu He Gly Phe Gly Gin Arg Val His Glu Tyr Cys Val Ala Asn Asn 
225 230 235 240 

He Asn His Val Tyr Trp Leu He Gin Gly Gly Gly His Asp Phe Asn 

245 250 255 

Val Trp Lys Pro Gly Leu Trp Asn Phe Leu Gin Met Ala Asp Glu Ala 

260 265 270 

Gly Leu Thr Arg Asp Gly Asn Thr Pro Val Pro Thr Pro Ser Pro Lys 
275 280 285 

Pro Ala Asn Thr Arg He Glu Ala Glu Asp Tyr Asp Gly He Asn Ser 
290 295 300 

Ser Ser He Glu He lie Gly Val Pro Pro Glu Gly Gly Arg Gly He 
305 310 315 320 

Glv Tyr He Thr Ser Gly Asp Tyr Leu Val Tyr Lys Ser He Asp Phe 

32 5 330 335 
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Gly Asn Gly Ala Thr Ser Phe Lys Ala Lys Val Ala Asn Ala Asn Thr 

340 345 350 

Ser Asn He Glu Leu Arg Leu Asn Gly Pro Asn Gly Thr Leu He Gly 
355 360 365 

Thr Leu Ser Val Lys Ser Thr Gly Asp Trp Asn Thr Tyr Glu Glu Gin 

370 375 380 

Thr Cys Ser lie Ser Lys Val Thr Gly lie Asn Asp Leu Tyr Leu Val 
385 390 395 400 

Phe Lys Gly Pro Val Asn He Asp Trp Phe Thr Phe Gly Val 

4 05 4 10 



<210> 105 
<211> 1425 
<212> DNA 

<213> artificial sequence 










<220> 

<223> plasmid 13039 












<400> 105 
atgctggcgg 


ct ct ggecac 


gtcgcagctc 


gtcgcaacgc 


gcgccggcct 


gggcg t cccg 


60 


gacgcgt cca 


cgt tccgccg 


cggcgccgcg 


cagggectga 


ggggggcccg 


ggcgtcggcg 


120 


gcggcggaca 


cgctcagcat 


gcggaccagc 


gcgcgcgcgg 


cgcccaggca 


ccagcaccag 


180 


caggcgcgcc 


gcggggccag 


gt tcccgtcg 


ct cgtcgtgt 


gcgccagcgc 


cggcgcca tg 


240 


gccgcct ccc 


t cccgaccat 


gccgccgt cc 


ggct acgacc 


aggtgcgcaa 


cggcgtgccg 


300 


cgcggccagg 


tggtgaaca t 


ct cctact tc 


tccaccgcca 


ccaactccac 


ccgcccggcc 


360 


cgcgtg tacc 


t cccgccggg 


ctact ccaaq 


gacaagaagt 


actccgt get 


ctacct cctc 


420 


cacggca teg 


gegget ccga 


gaacgactgg 


t tegagggeg 


gcggccgcgc 


caaegtgat c 


480 


gccgacaacc 


tcatcgccga 


gggcaaga t c 


aagccgctca 


t catcgt gac 


cccgaacacc 


540 


aacgccgccg 


gcccgggcat 


cgccgacggc 


tacgagaact 


tcaccaagga 


cct cctcaac 


600 


t ccct ca tec 


cgtaca tcga 


gtccaact ac 


teegt gtaca 


ccgaccgcga 


gcaccgcgcc 


660 


a t cgccggcc 


t ct ct atggg 


cggcggccag 


tcct t caaca 


teggect cac 


caacct cgac 


720 


aagt t cgect 


aca t cggccc 


ga tctccgcc 


gccccgaaca 


cctacccgaa 


cgagcgcctc 


780 


1 1 cccggacg 


geggcaagge 


cgcccgcgag 


aagct caagc 


tcctcttcat 


cgcctgcggc 


840 


accaacgact 


ccct ca tegg 


ct teggecag 


cgcgt gcacg 


ag tactgegt 


ggccaacaac 


900 
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at caaccacg 


t gt ac tggct 


ca t ccagggc 


ggcggccacg 


act tcaacgt 


gtggaagccg 


960 


ggcctctgga 


act t cctcca 


ga tggccgac 


gaggccggcc 


tcacccgcga 


cggcaacacc 


1020 


ccggt gccga 


ccccg t cccc 


gaagccggcc 


aacacccgca 


tcgaggccga 


ggact acgac 


1080 


ggcat caact 


cctcctccat 


cgaga tcatc 


ggcgt gccgc 


cggagggcgg 


ccgcggcatc 


1140 


nnr f ^ r3 t" PA 

y y U LdLa L LO 


rrtccaacaa 


ct acct cqt q 


tacaagt cca 


tcgact t egg 


caacggcgcc 


1200 


acctccttca 


aggccaaggt 


ggccaacgcc 


aacacctcca 


acat cgagct 


tcgcctcaac 


1260 


ggcccgaacg 


gcaccct ca t 


cggcaccct c 


tccgtgaagt 


ccaccggcga 


ctggaacacc 


1320 


t acgaggagc 


agacctgct c 


ca t ct ccaag 


gtgaccggca 


t caacgacct 


ctacctcg tg 


1380 


t t caagggcc 


cggt gaaca t 


cgactggt tc 


acct tcggcg 


t gt ag 




1425 



<210> 106 
<211> 474 
<212> PRT 

<213> artificial sequence 
<22C> 

<223> plasnud 13039 aa 

<400> 106 

Met Leu Ala Ala Leu Ala Thr Ser Gin Leu Val Ala Thr Arg Ala Gly 
15 10 15 

Leu Gly Val Pro Asp Ala Ser Thr Phe Arg Arg Gly Ala Ala Gin Gly 

20 25 30 

Leu Arg Gly Ala Arg Ala Ser Ala Ala Ala Asp Thr Leu Ser Met Arg 
35 40 45 

T-hr Ser Ala Arg Ala Ala Pro Arg His Gin His Gin Gin Ala Arg Arg 

50 ^ 55 60 



ly Ala Arg Phe Pro Ser Leu Val Val Cys Ala Ser Ala Gly Ala Met 
65 70 75 80 

Ala Ala Ser Leu Pro Thr Met Pro Pro Ser Gly Tyr Asp Gin Val Arg 

85 90 95 

Asn Gly Val Pro Arg Gly Gin Val Val Asn lie Ser Tyr Phe Ser Thr 

100 ' 105 HO 
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Ala Thr Asn Ser 
115 



Ser Lys Asp Lys 
130 



Gly Ser Glu Asn 
14 5 



Ala Asp Asn Leu 



Thr Pro Asn Thr 

180 



Asn Phe Thr Lys 
195 



Asn Tyr Ser Val 
210 



Ser Met Gly Gly 

225 



Lys Phe Ala Tyr 



Asn Glu Arg Leu 

260 



Lys Leu Leu Phe 
275 



Gly Gin Arg Val 
2 90 



Tyr Trp Leu lie 

305 



Gly Leu Trp Asn 



Asp Gly Asn Thr 

34 0 



Thr Arg Pro Ala 

120 



Lys Tyr Ser Val 
135 



Asp Trp Phe Glu 
150 



He Ala Glu Gly 
165 



Asn Ala Ala Gly 



Asp Leu Leu Asn 

200 



Tyr Thr Asp Arg 
215 



Gly Gin Ser Phe 
2 30 



He Gly Pro He 
245 



Phe Pro Asp Gly 



He Ala Cys Gly 

280 



His Glu Tyr Cys 
295 



Gin Gly Gly Gly 
310 



Phe Leu Gin Met 

325 



Pro Val Pro Thr 



Arg Val Tyr Leu 



Leu Tyr Leu Leu 

140 



Gly Gly Gly Arg 
155 



Lys He Lys Pro 
170 



Pro Gly He Ala 
18 5 



Ser Leu He Pro 



Glu His Arg Ala 

220 



Asn He Gly Leu 
235 



Ser Ala Ala Pro 
250 



Gly Lys Ala Ala 

2 65 



Thr Asn Asp Ser 



Val Ala Asn Asn 

300 



His Asp Phe Asn 
315 



Ala Asp Glu Ala 
330 



Pro Ser Pro Lys 
345 
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Pro Pro-Gly Tyr 
125 



His Gly He Gly 



Ala Asn Val He 

160 



Leu He He Val 
175 



Asp Gly Tyr Glu 
190 



Tyr He Glu Ser 
205 



He Ala Gly Leu 



Thr Asn Leu Asp 

240 



Asn Thr Tyr Pro 
255 



Arg Glu Lys Leu 

270 



Leu He Gly Phe 
285 



He Asn His Val 



Val Trp Lys Pro 

320 



Gly Leu Thr Arg 
335 



Pro Ala Asn Thr 
350 
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Arg lie Glu Ala Glu Asp Tyr Asp Gly lie Asn Ser Ser Ser lie Glu 
355 360 365 

lie He Gly Val Pro Pro Glu Gly Gly Arg Gly He Gly Tyr lie Thr 
370 375 380 

Ser Glv Asp Tyr Leu Val Tyr Lys Ser He Asp Phe Gly Asn Gly Ala 
385 390 395 400 

Thr Ser Phe Lys Ala Lys Val Ala Asn Ala Asn Thr Ser Asn He Glu 

405 410 415 

Leu Arg Leu Asn Gly Pro Asn Gly Thr Leu He Gly Thr Leu Ser Val 

420 425 430 

Lys Ser Thr Gly Asp Trp Asn Thr Tyr Glu Glu Gin Thr Cys Ser He 
4 35 ~ 440 445 

Ser Lys Val Thr Gly He Asn Asp Leu Tyr Leu Val Phe Lys Gly Pro 
450 455 460 

Val Asn He Asp Trp Phe Thr Phe Gly Val 
465 470 



<210> 107 

<211> 1263 

<212> DNA 

<213> artificial sequence 
<220> 

<223> plasmid 13347 



a tgagggt gt 


tgctcgt tgc 


cctcgctctc 


ctggct ctcg 


ctgcgagcgc 


cacct ccatg 


60 


gccgcc t ccc 


t cccgacca t 


gccgccgtcc 


ggct acgacc 


aggtgcgcaa 


cggcgtgccg 


120 


cgcggccagg 


tggtgaaca t 


ct cctact t c 


tccaccgcca 


ccaact ccac 


ccgcccggcc 


180 


cgcg tgt acc 


tcccgccggg 


ctactccaag 


gacaagaagt 


actccgtgct 


ctacct cctc 


240 


cacggca t eg 


gegget ccga 


gaacgactgg 


ttcgagggcg 


gcggccgcgc 


caacgtga tc 


300 


gccgacaacc 


tcatcgccga 


gggcaagatc 


aagccgctca 


teategtgae 


cccgaacacc 


360 


aacgccgccg 


gcccgggca t 


cgccgacggc 


tacgagaact 


t caccaagga 


cct cctcaac 


420 


t ccct ca t cc 


cgt acat cga 


gt ccaactac 


t ccgtgtaca 


ccgaccgcga 


gcaccgcgcc 


480 
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a t caccacicc 


t ct ctatggg 


cgqcggccag 


t cct t caaca 


tcggcctcac 


caacctcgac 


540 


aaattcacct 


acat cggccc 


ga t ctccgcc 


gccccgaaca 


cctacccgaa 


cgagcgcct c 


600 


1 1 cccggacg 


qegqeaagqe 


cgcccgcgag 


aagc t caagc 


t cctcttca t 


cgect gegge 


660 


a ccaa cga c t 


ccct categg 


ct teggecag 


cgcgt gcacg 


agt actgegt 


ggccaacaac 


720 


a t ca a ccaco 


tgt actggct 


ca tccagggc 


ggcggccacg 


act teaaegt 


gtggaagccg 


780 


aacctct ooa 


act t cct cca 


qa tggecgae 


gaggccggcc 


t cacccgcga 


cggcaacacc 


840 


ccaataccoa 


ccccgt cccc 


gaagccggcc 


aacacccgca 


t egaggcega 


ggact acgac 


900 


aacatcaact 


cctcct cca t 


cgaga teat c 


ggcg tgeege 


eggagggegg 


ccgcggca t c 


960 




cctccaocoa 


ct acctegtg 


tacaagt cca 


tcgact tegg 


caacggcgcc 


1020 


acctcct tea 


aggecaaggu 




A^r^rrt PCS 

O C* W t-* w v_> \_ cj 


acatcaaact 


t cgcctcaac 


1080 


ggcccgaacg 


gcaccct ca t 


cggcaccctc 


tccgtgaagt 


ccaccggcga 


ctggaacacc 


1140 


t acgaggagc 


agacc tgetc 


ca tct ccaag 


gtgaceggea 


t caacgacct 


ctacct cgtg 


1200 


c t caagggee 


egg tgaacat 


cgact ggt tc 


acct teggeg 


tgt ccgagaa 


ggacgaact c 


1260 


tag 












1263 



<210> 108 
<211> 420 
<212> PRT 

<213> artificial sequence 
<220> 

<223> plasmid 13347 
<400> 108 

Met Arq Val Leu Leu Val Ala Leu Ala Leu Leu Ala Leu Ala Ala Ser 
15 10 15 

Ala Thr Ser Met Ala Ala Ser Leu Pro Thr Met Pro Pro Ser Gly Tyr 

20 25 30 

Asp Gin Val Arg Asn Gly Val Pro Arg Gly Gin Val Val Asn He Ser 
35 40 45 

Tyr Phe Ser Thr Ala Thr Asn Ser Thr Arg Pro Ala Arg Val Tyr Leu 
50 55 60 

Pro Pro Glv Tyr Ser Lys Asp Lys Lys Tyr Ser Val Leu Tyr Leu Leu 
65 70 75 80 
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His Gly He Gly Gly Ser Glu Asn Asp Trp Phe Glu Gly Gly Gly Arg 

85 90 95 

Ala Asn Val He Ala Asp Asn Leu He Ala Glu Gly Lys He Lys Pro 

100 105 HO 

Leu He He Val Thr Pro Asn Thr Asn Ala Ala Gly Pro Gly He Ala 
115 120 125 

Asp Gly Tyr Glu Asn Phe Thr Lys Asp Leu Leu Asn Ser Leu He Pro 
130 135 1^0 

Tyr He Glu Ser Asn Tyr Ser Val Tyr Thr Asp Arg Glu His Arg Ala 
145 150 155 160 

He Ala Gly Leu Ser Met Gly Gly Gly Gin Ser Phe Asn He Gly Leu 

165 170 175 

Thr Asn Leu Asp Lys Phe Ala Tyr He Gly Pro He Ser Ala Ala Pro 

180 185 190 

Asn Thr Tyr Pro Asn Glu Arg Leu Phe Pro Asp Gly Gly Lys Ala Ala 
195 200 205 

Arg Glu Lys Leu Lys Leu Leu Phe He Ala Cys Gly Thr Asn Asp Ser 
210 215 220 

Leu He Gly Phe Gly Gin Arg Val His Glu Tyr Cys Val Ala Asn Asn 
225 230 235 240 

He Asn His Val Tyr Trp Leu He Gin Gly Gly Gly His Asp Phe Asn 

245 250 255 

Val Trp Lys Pro Gly Leu Trp Asn Phe Leu Gin Met Ala Asp Glu Ala 

260 265 270 

Gly Leu Thr Arg Asp Gly Asn Thr Pro Val Pro Thr Pro Ser Pro Lys 
275 280 285 

Pro Ala Asn Thr Arg He Glu Ala Glu Asp Tyr Asp Gly He Asn Ser 
290 " 295 300 
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Ser Ser lie Glu He He Gly Val Pro Pro Glu Gly Gly Arg Gly He 
305 310 315 320 

Gly Tyr lie Thr Ser Gly Asp Tyr Leu Val Tyr Lys Ser He Asp Phe 

325 330 335 

Glv Asn Gly Ala Thr Ser Phe Lys Ala Lys Val Ala Asn Ala Asn Thr 

340 345 350 

Ser Asn He Glu Leu Arg Leu Asn Gly Pro Asn Gly Thr Leu He Gly 
355 360 365 

Thr Lpu Ser Val Lys Ser Thr Gly Asp Trp Asn Thr Tyr Glu Glu Gin 

370 375 380 

Thr Cvs Ser He Ser Lys Val Thr Gly He Asn Asp Leu Tyr Leu Val 
385 390 395 400 

Phe Lys Gly Pro Val Asn He Asp Trp Phe Thr Phe Gly Val Ser Glu 

40b 410 415 



Lys Asp Glu Leu 

4 20 



<210> 
<211> 
<212> 
<213> 


109 

1296 

DNA 

artificial sequence 










<220> 
<223> 


plasmid 11267 












<400> 109 
a tgagggtgt 


tgctcgttgc 


cctcgctctc 


ctggctctcg 


ctgcgagcgc 


caccagcgct 


60 


gcgcagtccg 


agccggagct 


gaagctggag 


tccgtggtga 


tcgtgtcccg 


ccacggcgtg 


120 


cgcgccccga 


ccaaggccac 


ccagct ca t g 


caggacg tga 


ccccggacgc 


ct ggccgacc 


180 


t ggccggtga 


agct cggcga 


get ga ccccg 


cgcggcggcg 


agetgatege 


ctacct egge 


240 


cactactggc 


gccagcgcct 


cgtggccgac 


ggcctcct cc 


egaagtgegg 


ct gcccgcag 


300 


t ccggccagg 


t ggcca teat 


cgccgacgt g 


gaegagegea 


cccgcaagac 


cggcgaggcc 


360 


t tcgccgccg 


gect cgcccc 


ggactgcgcc 


atcaccgtgc 


acacccaggc 


cgacacctcc 


420 


t ccccggacc 


egctet tcaa 


cccgct caag 


accggcgtgt 


gccagctcga 


caacgccaac 


480 


gtgaccgacg 


ccatcctgga 


gcgcgccggc 


ggctccatcg 


ccgact t cac 


cggccact ac 


540 
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OSlQBCCGCCtL 


t ccqcqaqct 


ggagcgcgtg 


ctcaacttcc 


cgcagt ccaa 


cctctgcctc 


600 


>—j »^ ■ 


aqcaqqacqa 


gt cctgctcc 


ctcacccagg 


ccctcccgt c 


cgagctgaag 


660 


atatccacco 


act gcgtgtc 


cct caccggc 


gccgt gtccc 


t cgcct ccat 


gctcaccgaa 


720 


afrtt cc t cc 


t ccagcaggc 


ccagggcatg 


ccggagccgg 


gctggggccg 


catcaccgac 


780 


t cccacca at 


qqaacaccct 


cctctccctc 


cacaacgccc 


agt tcgacct 


cctccagcgc 


84 0 




tqqcccqctc 


ccqcgccacc 


ccgct cctcg 


acctca t caa 


gaccgccct c 


900 




cqccqcaqaa 


qcaggcctac 


ggcgtgaccc 


t cccgacctc 


cgtgctct t c 


960 


a tcaccqqcc 


acgacaccaa 


cctcgccaac 


ctcggcggcg 


ccctggagct 


gaactggacc 


1020 


ctcccgggcc 


-\ /"^ / — ' ^ At /*! ^ 3 3 

ayccgcjatda 




qqcqqcqaqc 


tqqtgt tcga 


gcgct ggcgc 


1080 


cgcct ctccg 


acaactccca 


gt ggat t cag 


gtgtccctcg 


tgt tccagac 


cct ccagcag 


1140 


a t gcgcgaca 


agaccccgct 


ctccctcaac 


accccgccgg 


gcgaggtgaa 


gctcaccct c 


1200 


gccggctgcg 


aggagcgcaa 


cgcccagggc 


a tgtgctccc 


t cgccggctt 


cacccagat c 


1260 


gtgaacgagg 


cccgca tccc 


ggcctgctcc 


ct ctaa 






1296 



<210> 110 
<211> 431 
<212> PRT 

<213> artificial sequence 
<220> 

<223> plasmid 11267 aa sequence 
<400> 110 

Met Arq Val Leu Leu Val Ala Leu Ala Leu Leu Ala Leu Ala Ala Ser 
15 10 15 

Ala Thr Ser Ala Ala Gin Ser Glu Pro Glu Leu Lys Leu Glu Ser Val 

20 25 30 

Val He Val Ser Arg His Gly Val Arg Ala Pro Thr Lys Ala Thr Gin 
35 40 45 

Leu Met Gin Asp Val Thr Pro Asp Ala Trp Pro Thr Trp Pro Val Lys 
50 55 60 

Leu Gly Glu Leu Thr Pro Arg Gly Gly Glu Leu He Ala Tyr Leu Gly 
65 70 75 80 
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His Tyr Trp Arg 



Gly Cys Pro Gin 

100 



Arg Thr Arg Lys 
115 



Cys Ala lie Thr 
130 



Leu Phe Asn Pro 
145 



Val Thr Asp Ala 



Thr Gly His Tyr 

180 



Phe Pro Gin Ser 
195 



Cys Ser Leu Thr 
210 



Cys Val Ser Leu 

225 



lie Phe Leu Leu 



Arg lie Thr Asp 

260 



Ala Gin Phe Asp 

275 



Ala Thr Pro Leu 
290 



Pro Gin Lys Gin 
305 



Gin Arg Leu Val 
85 



Ser Gly Gin Val 



Thr Gly Glu Ala 

120 



Val His Thr Gin 
135 



Leu Lys Thr Gly 
150 



He Leu Glu Arg 
165 



Gin Thr Ala Phe 



Asn Leu Cys Leu 

200 



Gin Ala Leu Pro 
215 



Thr Gly Ala Val 

230 



Gin Gin Ala Gin 

245 



Ser His Gin Trp 



Leu Leu Gin Arg 

280 



Leu Asp Leu He 
295 



Ala Tyr Gly Val 
310 



Ala Asp Gly Leu 
90 



Ala He He Ala 
105 



Phe Ala Ala Gly 



Ala Asp Thr Ser 

140 



Val Cys Gin Leu 
155 



Ala Gly Gly Ser 
170 



Arg Glu Leu Glu 
185 



Lys Arg Glu Lys 



Ser Glu Leu Lys 

220 



Ser Leu Ala Ser 
235 



Gly Met Pro Glu 
250 



Asn Thr Leu Leu 

265 



Thr Pro Glu Val 



Lys Thr Ala Leu 

300 



Thr Leu Pro Thr 
315 
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Leu Pro Lys Cys 
95 



Asp Val Asp Glu 
110 



Leu Ala Pro Asp 
125 



Ser Pro Asp Pro 



Asp Asn Ala Asn 

160 



He Ala Asp Phe 

175 



Arg Val Leu Asn 
190 



Gin Asp Glu Ser 
205 



Val Ser Ala Asp 



Met Leu Thr Glu 

240 



Pro Gly Trp Gly 
255 



Ser Leu His Asn 

270 



Ala Arg Ser Arg 
285 



Thr Pro His Pro 



Ser Val Leu Phe 

320 
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lie Ala Gly His Asp Thr Asn Leu Ala Asn Leu Gly Gly Ala Leu Glu 

325 330 335 

Leu Asn Trp Thr Leu Pro Gly Gin Pro Asp Asn Thr Pro Pro Gly Gly 

340 345 350 

Glu Leu Val Phe Glu Arg Trp Arg Arg Leu Ser Asp Asn Ser Gin Trp 
355 360 365 

Tie Gin Val Ser Leu Val Phe Gin Thr Leu Gin Gin Met Arg Asp Lys 
370 375 330 

Th- Pro Leu Ser Leu Asn Thr Pro Pro Gly Glu Val Lys Leu Thr Leu 
385 390 395 400 

Ala Gly Cys Glu Glu Arg Asn Ala Gin Gly Met Cys Ser Leu Ala Gly 

405 4 10 415 

Phe Thr Gin He Val Asn Glu Ala Arg He Pro Ala Cys Ser Leu 

420 425 430 



<210> 
<21 1> 
<212> 
<21 3> 


111 

1314 

DNA 

artificial sequence 










<220> 
<223> 


plasmid 11268 












<400> 111 
atgagggtgt 


tgct cgttgc 


cct cgctctc 


ctggctct eg 


c tgegagege 


caccagcgct 


60 


gcgcag tccg 


agccggagct 


gaagctggag 


teegt ggtga 


tcgtgtcccg 


ccacggcgtg 


120 


cgcgccccga 


ccaaggccac 


ccagct catg 


caggacgtga 


ccccggacgc 


ctggccgacc 


180 


tggccggt ga 


agct cggcga 


gctgaccccg 


cgcggcggcg 


agctgat cgc 


ctacctcggc 


240 


cact actggc 


gccagcgcct 


cgt ggccgac 


ggect cct cc 


egaagtgegg 


ctgcccgcag 


300 


t ccggccagg 


tggccatcat 


cgccgacgtg 


gaegagegea 


cccgcaagac 


cggcgaggcc 


360 


t tcgccgccg 


gcct cgcccc 


ggactgcgcc 


atcaccgtgc 


acacccaggc 


cgacacctcc 


420 


t ccccggacc 


cgct ct tcaa 


cccgctcaag 


accggcgtgt 


gccagctcga 


caacgccaac 


480 


gtgaccgacg 


cca tcctgga 


gcgcgccggc 


ggctcca teg 


ccgact t cac 


cggccactac 


540 


caqacc 


gcct 


t ccgcgagct 


ggagcgcgtg 


ctcaact t cc 


cgcag t ccaa 


cctctgcctc 


600 
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aagcgcgaga 


agcaggacga 


gtcctgctcc 


ctcacccagg 


ccctcccgtc 


cgagctgaag 


660 


gzg t ccgccg 


actgcgtgtc 


cctcaccggc 


gccgtg t ccc 


tcgcct ccat 


get caccgaa 


720 


a t ct t cct cc 


t ccagcaggc 


ccagggcatg 


ccggagccgg 


gctggggccg 


cat caccgac 


780 


t cccaccagt 


ggaacaccc t 


cctct ccct c 


cacaacgccc 


agtt cgacct 


cct ccaqcqc 


840 


accccggagg 


t ggcccgct c 


ccgcgccacc 


ccgct cctcg 


acct catcaa 


gaccgccct c 


900 


accccqcacc 


cgccgcagaa 


gcaggcctac 


ggcgt gaccc 


tcccgacctc 


cgtgctct tc 


960 


a t cgccggcc 


acgacaccaa 


cct cgccaac 


ctcggcggcg 


ccctggagct 


gaactggacc 


1020 


ct cccgggcc 


agccggacaa 


caccccgccg 


ggcggcgagc 


tggtgt tcga 


gcgctggcgc 


1080 


cgcct ctccg 


acaact ccca 


gtggat tcag 


g t gt ccctcg 


tgttccagac 


cct ccagcag 


1140 


a tgcgcgaca 


agaccccgct 


ctccct caac 


accccgccgg 


gcgaggt gaa 


get caccctc 


1200 


gccggc t gcg 


aggagcgcaa 


cgcccagggc 


at gtgct ccc 


t cgccggctt 


cacccaga t c 


1260 


gtgaacgagg 


cccgca tccc 


ggcctgctcc 


ctctccgaga 


aggacgagct 


gtaa 


1314 



<210> 112 

<211> 437 

<212> PRT 

<213> artificial sequence 
<220> 

<223> plasmid 11268 amino acid sequence 

<400> 112 

Met Arq Val Leu Leu Val Ala Leu Ala Leu Leu Ala Leu Ala Ala Ser 
1 5 10 15 

Ala Thr Ser Ala Ala Gin Ser Glu Pro Glu Leu Lys Leu Glu Ser Val 

20 25 30 

Val He Val Ser Arg His Gly Val Arg Ala Pro Thr Lys Ala Thr Gin 
35 ^O 45 

Leu Met Gin Asp Val Thr Pro Asp Ala Trp Pro Thr Trp Pro Val Lys 
50 55 60 

Leu Gly Glu Leu Thr Pro Arg Gly Gly Glu Leu He Ala Tyr Leu Gly 
65 70 75 80 

His Tyr Trp Arg Gin Arg Leu Val Ala Asp Gly Leu Leu Pro Lys Cys 

8 5 90 95 
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Gly Cys Pro Gin Ser Gly Gin Val Ala lie lie Ala Asp Val Asp Glu 

100 105 HO 

Arg Thr Arg Lys Thr Gly Glu Ala Phe Ala Ala Gly Leu Ala Pro Asp 
115 120 125 

Cys Ala lie Thr Val His Thr Gin Ala Asp Thr Ser Ser Pro Asp Pro 
130 135 140 

Leu Phe Asn Pro Leu Lys Thr Gly Val Cys Gin Leu Asp Asn Ala Asn 
145 150 155 160 

Val Thr Asp Ala lie Leu Glu Arg Ala Gly Gly Ser lie Ala Asp Phe 

,165 170 175 

Thr Glv His ^yr Gin Thr Ala Phe Arg Glu Leu Glu Arg Val Leu Asn 

180 185 190 

Phe Pro Gin Ser Asn Leu Cys Leu Lys Arg Glu Lys Gin Asp Glu Ser 
195 200 205 

Cvs Ser Leu Thr Gin Ala Leu Pro Ser Glu Leu Lys Val Ser Ala Asp 
210 215 220 

Cvs Val Ser Leu Thr Gly Ala Val Ser Leu Ala Ser Met Leu Thr Glu 
225 230 235 240 

lie Phe Leu Leu Gin Gin Ala Gin Gly Met Pro Glu Pro Gly Trp Gly 

24 5 2 50 2 55 

Ara He Thr Asp Ser His Gin Trp Asn Thr Leu Leu Ser Leu His Asn 

260 265 270 

Ala Gin Phe Asp Leu Leu Gin Arg Thr Pro Glu Val Ala Arg Ser Arg 
275 280 285 

Ala Thr Pro Leu Leu Asp Leu lie Lys Thr Ala Leu Thr Pro His Pro 
290 295 300 

Pro Gin Lys Gin Ala Tyr Gly Val Thr Leu Pro Thr Ser Val Leu Phe 
305 310 315 320 



Page 134 



WO 2005/096804 



PCT/US2004/007182 



lie Ala Gly His Asp Thr Asn Leu Ala Asn Leu Gly Gly Ala Leu Glu 

325 330 335 

Leu Asn Trp Thr Leu Pro Gly Gin Pro Asp Asn Thr Pro Pro Gly Gly 

340 345 350 

Glu Leu Val Phe Glu Arg Trp Arg Arg Leu Ser Asp Asn Ser Gin Trp 
355 360 365 

He Gin Val Ser Leu Val Phe Gin Thr Leu Gin Gin Met Arg Asp Lys 
370 375 380 

Thr Pro Leu Ser Leu Asn Thr Pro Pro Gly Glu Val Lys Leu Thr Leu 
385 390 395 400 

Ala Gly Cys Glu Glu Arg Asn Ala Gin Gly Met Cys Ser Leu Ala Gly 

405 " 410 415 

Phe Thr Gin lie Val Asn Glu Ala Arg He Pro Ala Cys Ser Leu Ser 

420 425 430 



Glu Lys Asp Glu Leu 
4 35 
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